DYNAMICS OF THE STANDARD MODEL 


Describing the fundamental theory of particle physics and its applications, this 
book provides a detailed account of the Standard Model, focusing on techniques 
that can produce information about real observed phenomena. 

The book begins with a pedagogic account of the Standard Model, introducing 
essential techniques such as effective field theory and path-integral methods. It then 
focuses on the use of the Standard Model in the calculation of physical properties 
of particles. Rigorous methods are emphasized, but other useful models are also 
described. 

This second edition has been updated to include recent theoretical and exper- 
imental advances, such as the discovery of the Higgs boson. A new chapter is 
devoted to the theoretical and experimental understanding of neutrinos, and major 
advances in CP violation and electroweak physics have been given a modern treat- 
ment. This book is valuable to graduate students and researchers in particle physics, 
nuclear physics and related fields. This title, first published in 2014, has 
been reissued as an Open Access publication on Cambridge Core. 


JOHN F. DONOGHUE is Distinguished Professor in the Department of Physics, 
University of Massachusetts. His research spans particle physics, quantum field 
theory and general relativity. He is a Fellow of the American Physical Society. 


EUGENE GOLOWICH is Emeritus Professor in the Department of Physics, 
University of Massachusetts. His research has focused on particle theory and phe- 
nomenology. He is a Fellow of the American Physical Society and is a recipient of 
the College Outstanding Teacher award from the University of Massachusetts. 


BARRY R. HOLSTEIN is Emeritus Professor in the Department of Physics, 
University of Massachusetts. His research is in the overlap area of particle and 
nuclear theory. A Fellow of the American Physical Society, he is also the editor 
of Annual Reviews of Nuclear and Particle Science and is a longtime consulting 
editor of the American Journal of Physics. 


CAMBRIDGE MONOGRAPHS ON PARTICLE PHYSICS, NUCLEAR 
PHYSICS AND COSMOLOGY 


General Editors: T. Ericson, P. V. Landshoff 


Available titles in this series: 


3. E. Leader and E. Predazzi: An Introduction to Gauge Theories and Modern Particle Physics, 
Volume 1: Electroweak Interactions, the ‘New Particles’ and the Parton Model 

4. E. Leader and E. Predazzi: An Introduction to Gauge Theories and Modern Particle Physics, 
Volume 2: CP-Violation, QCD and Hard Processes 

6. H. Grosse and A. Martin: Particle Physics and the Schrödinger Equation 

7. B. Andersson: The Lund Model 

8. R. K. Ellis, W. J. Stirling and B. R. Webber: QCD and Collider Physics 

10. A. V. Manohar and M. B. Wise: Heavy Quark Physics 

11. R. Friihwirth, M. Regler, R. K. Bock, H. Grote and D. Notz: Data Analysis Techniques for 
High-Energy Physics, Second edition 

12. D. Green: The Physics of Particle Detectors 

13. V. N. Gribov and J. Nyiri: Quantum Electrodynamics 

14. K. Winter (ed.): Neutrino Physics, Second edition 

15. E. Leader: Spin in Particle Physics 

16. J. D. Walecka: Electron Scattering for Nuclear and Nucleon Structure 

17. S. Narison: QCD as a Theory of Hadrons 

18. J. F. Letessier and J. Rafelski: Hadrons and Quark-Gluon Plasma 

19. A Donnachie, H. G. Dosch, P. V. Landshoff and O. Nachtmann: Pomeron Physics and QCD 
20. A. Hofmann: The Physics of Synchrotron Radiation 

21. J. B. Kogut and M. A. Stephanov: The Phases of Quantum Chromodynamics 

22. D. Green: High Pr Physics at Hadron Colliders 

23. K. Yagi, T. Hatsuda and Y. Miake: Quark-Gluon Plasma 

24. D. M. Brink and R. A. Broglia: Nuclear Superfluidity 

25. F. E. Close, A. Donnachie and G. Shaw: Electromagnetic Interactions and Hadronic Structure 
26. C. Grupen and B. A. Schwartz: Particle Detectors, Second edition 

27. V. Gribov: Strong Interactions of Hadrons at High Energies 

28. I. I. Bigi and A. I. Sanda: CP Violation, Second edition 

29. P. Jaranowski and A. Królak: Analysis of Gravitational-Wave Data 

30. B. L. Ioffe, V. S. Fadin and L. N. Lipatov: Quantum Chromodynamics: Perturbative and 
Nonperturbative Aspects 

31. J. M. Cornwall, J. Papavassiliou and D. Binosi: The Pinch Technique and its Applications to 
Non-Abelian Gauge Theories 

32. J. Collins: Foundations of Perturbative QCD 

33. Y. V. Kovchegov and E. Levin: Quantum Chromodynamics at High Energy 

34. J. Rak and M. J. Tannenbaum: High-pT Physics in the Heavy Ion Era 

35. J. F. Donoghue, E. Golowich and B. R. Holstein: Dynamics of the Standard Model, 

Second edition 


DYNAMICS OF THE STANDARD 
MODEL 


SECOND EDITION 


JOHN F. DONOGHUE 


University of Massachusetts 


EUGENE GOLOWICH 


University of Massachusetts 


BARRY R. HOLSTEIN 


University of Massachusetts 


CAMBRIDGE 


UNIVERSITY PRESS 


CAMBRIDGE 


9 UNIVERSITY PRESS 


Shaftesbury Road, Cambridge CB2 8EA, United Kingdom 
One Liberty Plaza, 20th Floor, New York, NY 10006, USA 
477 Williamstown Road, Port Melbourne, VIC 3207, Australia 
314-321, 3rd Floor, Plot 3, Splendor Forum, Jasola District Centre, New Delhi — 110025, India 
103 Penang Road, #05-06/07, Visioncrest Commercial, Singapore 238467 


Cambridge University Press is part of Cambridge University Press & Assessment, 
a department of the University of Cambridge. 


We share the University’s mission to contribute to society through the pursuit of 
education, learning and research at the highest international levels of excellence. 


www.cambridge.org 
Information on this title: www.cambridge.org/9781009291002 


DOI: 10.1017/9781009291033 
© John F. Donoghue, Eugene Golowich and Barry R. Holstein 2022 


This work is in copyright. It is subject to statutory exceptions and to the provisions 
of relevant licensing agreements; with the exception of the Creative Commons version the 
link for which is provided below, no reproduction of any part of this work may take 
place without the written permission of Cambridge University Press. 


An online version of this work is published at doi-org/10.1017/9781009291033 under a 
Creative Commons Open Access license CC-BY-NC-ND 4.0 which permits re-use, 
distribution and reproduction in any medium for non-commercial purposes providing 
appropriate credit to the original work is given. You may not distribute derivative works 
without permission. To view a copy of this license, visit 
https://creativecommons.org/licenses/by-nc-nd/4.0 


All versions of this work may contain content reproduced under license from third parties. 
Permission to reproduce this third-party content must be obtained from these third-parties directly. 


When citing this work, please include a reference to the DOI 10.1017/9781009291033 


First published 2014 
Reissued as OA 2022 


A catalogue record for this publication is available from the British Library. 


ISBN 978-1-009-29100-2 Hardback 
ISBN 978-1-009-29101-9 Paperback 


Cambridge University Press & Assessment has no responsibility for the persistence 
or accuracy of URLs for external or third-party internet websites referred to in this 
publication and does not guarantee that any content on such websites is, or will remain, 
accurate or appropriate. 


To Lincoln Wolfenstein 


I 


Contents 


Preface to the second edition 


Inputs to the Standard Model 


I-1 


I-2 


14 


I-5 


Quarks and leptons 

Quarks 

Leptons 

Quark and lepton numbers 
Chiral fermions 

The massless limit 

Parity, time reversal, and charge conjugation 
Fermion mass 

Dirac mass 

Majorana mass 

Symmetries and near symmetries 
Noether currents 

Examples of Noether currents 
Approximate symmetry 

Gauge symmetry 

Abelian case 

Nonabelian case 

Mixed case 

On the fate of symmetries 
Hidden symmetry 


Spontaneous symmetry breaking in the sigma model 


Interactions of the Standard Model 


H-1 


Quantum Electrodynamics 
U(1) gauge symmetry 
QED to one loop 


vii 


page xvii 


me OO ON UvAnuULUvNEe = 


— 


N NNN BRR -e 
AUNE OWN AD W 


U” NNN 
= CO œ CO 


Vili 


lil 


IV 


I[-2 


II-3 


II-4 


Contents 


On-shell renormalization of the electric charge 
Electric charge as a running coupling constant 
Quantum Chromodynamics 

SU(3) gauge symmetry 

QCD to one loop 

Asymptotic freedom and renormalization group 
Electroweak interactions 

Weak isospin and weak hypercharge assignments 
SU(2), x U (l)y gauge-invariant lagrangian 
Spontaneous symmetry breaking 

Electroweak currents 

Fermion mixing 

Diagonalization of mass matrices 

Quark mixing 

Neutrino mixing 

Quark CP violation and rephasing invariants 


Symmetries and anomalies 


Iii-1 
I-2 


I-3 


TH-4 
H-5 


HI-—6 


Symmetries of the Standard Model 

Path integrals and symmetries 

The generating functional 

Noether’s theorem and path integrals 

The U(1) axial anomaly 

Diagrammatic analysis 

Path-integral analysis 

Classical scale invariance and the trace anomaly 
Chiral anomalies and vacuum structure 

The 0 vacuum 

The 0 term 

Connection with chiral rotations 

Baryon- and lepton-number violation in the Standard Model 


Introduction to effective field theory 


Iv-1 


IV-2 


Iv-3 


Effective lagrangians and the sigma model 
Representations of the sigma model 
Representation independence 

Integrating out heavy fields 

The decoupling theorem 

Integrating out heavy fields at tree level 
Matching the sigma model at tree level 
Loops and renormalization 


35 
37 
40 
40 
45 
51 
57 
57 
60 
62 
63 
66 
66 
68 
70 
72 


76 
76 
79 
80 
81 
82 
84 
88 
95 
98 
99 
101 
102 
103 


106 
106 
107 
109 
111 
111 
112 
114 
115 


VI 


IV-4 


IV-5 
IV-6 


IV-7 
IV-8 
IV-9 


Contents 


General features of effective field theory 
Effective lagrangians and symmetries 

Power counting and loops 

Weinberg’s power-counting theorem 

The limits of an effective field theory 
Symmetry breaking 

Matrix elements of currents 

Matrix elements and the effective action 
Effective field theory of regions of a single field 
Effective lagrangians in QED 

Effective lagrangians as probes of New Physics 


Charged leptons 


V-1 


The electron 

Breit—Fermi interaction 

QED corrections 

Precision tests of QED 

The infrared problem 

The muon 

Muon decay at tree level 
Precise determination of G,, 
Leading-order photonic correction 
The t lepton 

Exclusive leptonic decays 
Exclusive semileptonic decays 
Inclusive semileptonic decays 
Some applications of t decays 


Neutrinos 


VI-l 


VI-2 
VI-3 


VI—4 


Neutrino mass 


Equivalence of heavy Majorana mass to a dimension-five 


operator 
Lepton mixing 
Theory of neutrino oscillations 
Oscillations in vacuum 
Oscillations in matter: MSW effect 
CP violation 
Neutrino phenomenology 
Solar and reactor neutrinos: 0; and Am? 1 
Atmospheric and accelerator neutrinos: 623 and |Am3,| 
Short-baseline studies: 613 


ix 
119 
120 
121 
122 
123 
124 
126 
127 
129 
136 
138 


144 
144 
144 
148 
151 
153 
155 
155 
157 
159 
163 
164 
164 
167 
168 


173 
173 


175 
176 
179 
179 
181 
184 
185 
185 
190 
191 


Vil 


Vill 


Ix 


Contents 


VI-5 Testing for the Majorana nature of neutrinos 
VI-6  Leptogenesis 
VI-7 Number of light neutrino species 

Studies at the Z? peak 

Astrophysical data 


Effective field theory for low-energy QCD 
VU-1 QCD at low energies 

Vacuum expectation values and masses 

Quark mass ratios 

Pion leptonic decay, radiative corrections, and Fy 
VII-2 Chiral perturbation theory to one loop 

The order E* lagrangian 

The renormalization program 
VU-3 The nature of chiral predictions 

The pion form factor 

Rare processes 

Pion-—pion scattering 
VU-4 The physics behind the QCD chiral lagrangian 
VU-5 The Wess—Zumino—Witten anomaly action 
VII-6 The axial anomaly and m? > yy 


Weak interactions of kaons 
VUI-1 Leptonic and semileptonic processes 
Leptonic decay 
Kaon beta decay and Vus 
VUI-2 The nonleptonic weak interaction 
VUI-3 Matching to QCD at short distance 
Short-distance operator basis 
Perturbative analysis 
Renormalization-group analysis 
VIN-4 The AJ = 1/2 rule 
Phenomenology 
Chiral lagrangian analysis 
Vacuum saturation 
Nonleptonic lattice matrix elements 
VIII-5 Rare kaon decays 
Mass mixing and CP violation 
IX-1  K°—-K° mixing 
CP-conserving mixing 
IX-2 The phenomenology of kaon CP violation 


192 
195 
197 
197 
197 


200 
200 
201 
202 
205 
209 
210 
211 
215 
215 
219 
223 
225 
228 
233 


237 
237 
237 
238 
240 
242 
242 
243 
245 
249 
249 
252 
253 
254 
255 


260 
260 
263 
266 


XI 


IX-3 


IX—4 


Contents 


Kaon CP violation in the Standard Model 

Analysis of |e| 

Analysis of |e’| 

Chiral analysis of (€’/€) swe 

The strong CP problem 

The parameter 6 

Connections with the neutron electric dipole moment 


The N;' expansion 


X-1 


The nature of the large Ne limit 
Spectroscopy in the large N. limit 
Goldstone bosons and the axial anomaly 
The OZI rule 

Chiral lagrangians 


Phenomenological models 


XI-1 


XI-3 


XI—4 


Quantum numbers of OO and Q? states 
Hadronic flavor—spin state vectors 
Quark spatial wavefunctions 
Interpolating fields 

Potential model 

Basic ingredients 

Mesons 

Baryons 

Color dependence of the interquark potential 
Bag model 

Static cavity 

Spherical-cavity approximation 
Gluons in a bag 

The quark—gluon interaction 
Skyrme model 

Sine—Gordon soliton 

Chiral SU (2) soliton 

The Skyrme soliton 

Quantization and wavefunctions 
QCD sum rules 

Correlators 

Operator-product expansion 
Master equation 

Examples 


xi 


269 
270 
270 
271 
273 
273 
275 


278 
278 
280 
283 
285 
287 


291 
291 
291 
295 
297 
298 
298 
300 
301 
302 
303 
304 
304 
307 
308 
308 
309 
310 
312 
314 
318 
319 
321 
323 
324 


xii 


Contents 


XII Baryon properties 


XIII 


XIV 


XI-1 


XII-2 


XII-3 


XII —4 


XII-5 
XII-6 


Matrix-element computations 
Flavor and spin matrix elements 
Overlaps of spatial wavefunctions 
Connection to momentum eigenstates 
Calculations in the Skyrme model 
Electroweak matrix elements 
Magnetic moments 

Semileptonic matrix elements 
Symmetry properties and masses 
Effective lagrangians for baryons 
Baryon mass splittings and quark masses 
Goldberger—Treiman relation 

The nucleon sigma term 
Strangeness in the nucleon 

Quarks and nucleon spin structure 
Nuclear weak processes 
Measurement of Vaa 

The pseudoscalar axial form factor 
Hyperon semileptonic decay 
Nonleptonic decay 
Phenomenology 

Lowest-order chiral analysis 


Hadron spectroscopy 


XII-1 


XIU-2 


XIU-3 


XIN-4 


The charmonium and bottomonium systems 
Transitions in quarkonium 

Light mesons and baryons 

SU(6) classification of the light hadrons 
Regge trajectories 

SU(6) breaking effects 

The heavy-quark limit 

Heavy-flavored hadrons in the quark model 
Spectroscopy in the mg — œ limit 
Nonconventional hadron states 

The first resonance — 0 (440) 

Gluonia 

Additional nonconventional states 


Weak interactions of heavy quarks 


XIV-1 


Heavy-quark mass 


330 
330 
330 
332 
333 
336 
339 
339 
341 
343 
343 
344 
347 
348 
349 
351 
355 
355 
357 
359 
360 
360 
362 


366 
366 
312 
376 
376 
379 
381 
385 
385 
387 
390 
391 
394 
396 


399 
399 


XV 


XVI 


XIV-2 


XIV-3 


XIV—4 


XIV-5 
XIV—6 


XIV-7 


Contents 


Running quark mass 

The pole mass of a quark 

Inclusive decays 

The spectator model 

The heavy-quark expansion 

The top quark 

Exclusive decays in the heavy-quark limit 
Inclusive vs. exclusive models for b —> cev, 
Heavy Quark Effective Theory and exclusive decays 
B°—B° and D?— D’ mixing 

B?-B° mixing 

D?-D° mixing 

The unitarity triangle 

CP violation in B-meson decays 

CP-odd signals induced by mixing 
Decays to CP eigenstates 

Decays to non-CP eigenstates 
Semileptonic asymmetries 

CP-odd signals not induced by mixing 
Rare decays of B mesons 

The quark transition b > sy 

The decay B, > 4+8 


The Higgs boson 
XV-1 Introduction 
XV-2 Mass and couplings of the Higgs boson 


XV-3 


XV—4 


XV-5 
XV-6 


Higgs mass term 

The naturalness problem 

Higgs coupling constants 

Production and decay of the Higgs boson 
Decay 

Production 


Comparison of Standard Model expectations with LHC data 


Higgs contributions to electroweak corrections 
The corrections Ap and Ar 

Custodial symmetry 

The quantum Higgs potential and vacuum stability 
Two Higgs doublets 


The electroweak sector 


XVI-1 


Neutral weak phenomena at low energy 


xiii 
399 
401 
403 
403 
405 
408 
409 
410 
411 
416 
416 
418 
420 
421 
421 
423 
426 
427 
428 
430 
430 
432 


434 
434 
435 
436 
436 
437 
442 
442 
445 
447 
448 
449 
450 
452 
454 


458 
458 


XIV 


XVI-2 


XVI-3 


XVI—4 


XVI-5 


XVI-6 


Appendix A 
A-1 


A-3 


A-5 
A-6 


Contents 


Neutral-current effective lagrangians 

Deep-inelastic neutrino scattering from isoscalar targets 
Atomic parity violation in cesium 

Polarized Møller scattering 

Measurements at the Z? mass scale 

Decays of Z? into fermion-antifermion pairs 
Asymmetries at the Z? peak 

Definitions of the weak mixing angle 

Some W~ properties 


Decays of W~ into fermions 
Triple-gauge couplings 

The quantum electroweak lagrangian 

Gauge fixing and ghost fields in the electroweak sector 
A subset of electroweak Feynman rules 

On-shell determination of electroweak parameters 
Self-energies of the massive gauge bosons 

The charged gauge bosons W~ 


The neutral gauge bosons Z°, y 

Examples of electroweak radiative corrections 
The O(G,,m?) contribution to A, 

The O(G,,m?) contribution to Ar 

The Z— bb vertex correction 

Precision tests and New Physics 


Functional integration 
Quantum-mechanical formalism 
Path-integral propagator 
External sources 

The generating functional 

The harmonic oscillator 
Field-theoretic formalism 

Path integrals with fields 
Generating functional with fields 
Quadratic forms 

Background field method to one loop 
Fermion field theory 

Gauge theories 

Gauge fixing 

Ghost fields 


459 
461 
462 
463 
464 
466 
467 
469 
471 
471 
472 
474 
475 
476 
478 
479 
480 
482 
483 
484 
486 
488 
489 


493 
493 
493 
496 
497 
499 
502 
502 
503 
506 
507 
509 
512 
513 
516 


Contents 


Appendix B Advanced field-theoretic methods 
B-1 The heat kernel 
B-2 Chiral renormalization and background fields 
B-3 PCAC and the soft-pion theorem 
B44 Matching fields with different symmetry-transformation 
properties 
Appendix C Useful formulae 
C-1 Numerics 
C-2 Notations and identities 
C-3 Decay lifetimes and cross sections 
C4 Field dimension 
C5 Mathematics in d dimensions 
References 


Index 


XV 


520 
520 
524 
529 


532 


535 
535 
535 
538 
541 
541 


545 
567 


Preface to the second edition 


The Standard Model is the basis of our understanding of the fundamental inter- 
actions. At the present time, it remains in excellent agreement with experiment. It 
is clear that any further progress in the field will need to build on a solid under- 
standing of the Standard Model. Since the first edition was written in 1992 there 
have been major discoveries in neutrino physics, in CP violation, the discoveries 
of the top quark and the Higgs boson, and a dramatic increase in precision in both 
electroweak physics and in QCD. We feel that the present is a good moment to 
update our book, as the Standard Model seems largely complete. 

The opportunity to revise our book at this time has also enabled us to survey the 
progress since the first edition went to print. Besides the experimental discoveries 
that have taken place during these two decades, we have been impressed by the 
increase in theoretical sophistication. Many of the topics which were novel at the 
time of the first edition have now been extensively developed. Perturbative treat- 
ments have progressed to higher orders and new techniques have been developed. 
To cover all of these completely would require the expansion of many chapters 
into book-length treatments. Indeed, in many cases, entire new books dedicated to 
specialized topics have been published.! Our revision is meant as a coherent peda- 
gogic introduction to these topics, providing the reader with the basic background 
to pursue more detailed studies when appropriate. 

There has also been great progress on the possible New Physics which could 
emerge beyond the Standard Model — dark matter and dark energy, grand unifica- 
tion, supersymmetry, extra dimensions, etc. We are at a moment where this physics 
could emerge in the next round of experiments at the Large Hadron Collider (LHC) 
as well as in precision measurements at the intensity frontier. We look forward with 
great anticipation to the new discoveries of the next decade. 


! For example, see [BaP 99, Be 00, BiS 00, EISW 03, FuS 04, Gr 04, IoFL 10, La 10, Ma 04, MaW 07, Co 11] 
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xviii Preface to the second edition 


We thank our colleagues and students for feedback about the first edition of this 
book. A list of errata for the second edition will be maintained at the homepage 
of John Donoghue at the University of Massachusetts, Amherst. We encourage 
readers who find any mistakes in this edition to submit them to Professor Donoghue 
at donoghue @ physics.umass.edu. 


From the preface to the first edition 


The Standard Model lagrangian Lsm embodies our knowledge of the strong and 
electroweak interactions. It contains as fundamental degrees of freedom the spin 
one-half quarks and leptons, the spin one gauge bosons, and the spin zero Higgs 
fields. Symmetry plays the central role in determining its dynamical structure. The 
lagrangian exhibits invariance under SU (3) gauge transformations for the strong 
interactions and under SU (2) x U(1) gauge transformations for the electroweak 
interactions. Despite the presence of (all too) many input parameters, it is a mathe- 
matical construction of considerable predictive power. 

There are books available which describe in detail the construction of Lsm and 
its quantization, and which deal with aspects of symmetry breaking. We felt the 
need for a book describing the next steps, how £sm is connected to the observable 
physics of the real world. There are a considerable variety of techniques, of differ- 
ing rigor, which are used by particle physicists to accomplish this. We present here 
those which have become indispensable tools. In addition, we attempt to convey 
the insights and ‘conventional wisdom’ which have been developed throughout the 
field. This book can only be an introduction to the riches contained in the subject, 
hopefully providing a foundation and a motivation for further exploration by its 
readers. 

In writing the book, we have become all too painfully aware that each topic, 
indeed each specific reaction, has an extensive literature and phenomenology, and 
that there is a limitation to the depth that can be presented compactly. We empha- 
size applications, not fundamentals, of quantum field theory. Proofs of formal top- 
ics like renormalizability or the quantization of gauge fields are left to other books, 
as is the topic of parton phenomenology. In addition, the study by computer of 
lattice field theory is an extensive and rapidly changing discipline, which we do 
not attempt to cover. Although it would be tempting to discuss some of the many 
stimulating ideas, among them supersymmetry, grand unification, and string the- 
ory, which attempt to describe physics beyond the Standard Model, limitations of 
space prevent us from doing so. 

Although this book begins gently, we do assume that the reader already has some 
familiarity with quantum field theory. As an aid to those who lack familiarity with 
path-integral methods, we include a presentation, in Appendix A, which treats this 
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subject in an introductory manner. In addition, we assume a knowledge of the basic 
phenomenology of particle physics. 

We have constructed the material to be of use to a wide spectrum of readers 
who are involved with the physics of elementary particles. Certainly it contains 
material of interest to both theorist and experimentalist alike. Given the trend to 
incorporate the Standard Model in the study of nuclei, we expect the book to be 
of use to the nuclear physics community as well. Even the student being trained in 
the mathematics of string theory would be well advised to learn the role that sigma 
models play in particle theory. 

This is a good place to stress some conventions employed in this book. Chap- 
ters are identified with roman numerals. In cross-referencing equations, we include 
the chapter number if the referenced equation is in a chapter different from the 
point of citation. The Minkowski metric is g,,, = diag {1, —1, —1, —1}. Through- 
out, we use the natural units A = c = 1, and choose e > O so that the elec- 
tron has electric charge —e. We employ rationalized Heaviside—Lorentz units, and 
the fine-structure constant is related to the charge via a = e7/47r. The coupling- 
constants for the SU (3). x SU(2); x U(1) gauge structure of the Standard Model 
are denoted respectively as g3, g2, gı, and we employ coupling-constant phase 
conventions analogous to electromagnetism for the other abelian and nonabelian 
covariant derivatives of the Standard Model. The chiral projection operator for left- 
handed massless spin one-half particles is (1+ ys)/2, and in analyzing systems in d 
dimensions, we employ the parameter € = (4—d)/2. What is meant by the ‘Fermi 
constant’ is discussed in Sect. V—2. 


Amherst, MA, 2013 


I 
Inputs to the Standard Model 


This book is about the Standard Model of elementary particle physics. If we set 
the beginning of the modern era of particle physics in 1947, the year the pion was 
discovered, then the ensuing years of research have revealed the existence of a con- 
sistent, self-contained layer of reality. The energy range which defines this layer of 
reality extends up to about 1 TeV or, in terms of length, down to distances of order 
1071" cm. The Standard Model is a field-theoretic description of strong and electro- 
weak interactions at these energies. It requires the input of as many as 28 inde- 
pendent parameters.! These parameters are not explained by the Standard Model; 
their presence implies the need for an understanding of Nature at an even deeper 
level. Nonetheless, processes described by the Standard Model possess a remark- 
able insulation from signals of such New Physics. Although the strong interactions 
remain a calculational challenge, the Standard Model (generalized from its original 
form to include neutrino mass) would appear to have sufficient content to describe 
all existing data.” Thus far, it is a theoretical structure which has worked splendidly. 


I-1 Quarks and leptons 


The Standard Model is an SU(3) x SU(2) x U(1) gauge theory which is spon- 
taneously broken by the Higgs potential. Table I-1 displays mass determinations 
[RPP 12] of the Z? and W* gauge bosons, the Higgs boson H°, and the existing 
mass limit on the photon y. 

In the Standard Model, the fundamental fermionic constitutents of matter are the 
quarks and the leptons. Quarks, but not leptons, engage in the strong interactions 
as a consequence of their color charge. Each quark and lepton has spin one-half. 


l There are six lepton masses, six quark masses, three gauge coupling constants, three quark-mixing angles 
and one complex phase, three neutrino-mixing angles and as many as three complex phases, a Higgs mass 
and quartic coupling constant, and the QCD vacuum angle. 

2 Admittedly, at this time the sources of dark matter and of dark energy are unknown. 


2 Inputs to the Standard Model 


Table I-1. Boson masses. 


Particle Mass (GeV/c?) 
y <1x 10-2? 

wt 80.385 + 0.015 
z? 91.1876 + 0.0021 
H? 126.0 + 0.4 


Collectively, they display conventional Fermi—Dirac statistics. No attempt is made 
in the Standard Model either to explain the variety and number of quarks and lep- 
tons or to compute any of their properties. That is, these particles are taken at this 
level as truly elementary. This is not unreasonable. There is no experimental evi- 
dence for quark or lepton compositeness, such as excited states or form factors 
associated with intrinsic structure. 


Quarks 


There are six quarks, which fall into two classes according to their electrical charge 
Q. The u, c, t quarks have Q = 2e/3 and the d, s, b quarks have Q = — e/3, 
where e is the electric charge of the proton. The u, c, t and d, s, b quarks 
are eigenstates of the hamiltonian (‘mass eigenstates’). However, because they are 
believed to be permanently confined entities, some thought must go into properly 
defining quark mass. Indeed, several distinct definitions are commonly used. We 
defer a discussion of this issue and simply note that the values in Table I-2 provide 


Table I-2. The quarks. 


Flavor Mass“ (GeV/c?) Charge i S C B T 
u (255, x 1073 2e/3 1/2 0 0 0 0 
d (6.04772) x 1073 —e/3 —1/2 0 0 0 0 
s 0.105+9075 —e/3 0 -1 0 0 0 
c LTN 2e/3 0 0 1 0 0 
b A208) —e/3 0 0 0 -=i 0 
t 173.4 + 1.6 2e/3 0 0 0 0 1 


“The t-quark mass is inferred from top quark events. All others are determined in MS 
renormalization (cf. Sect. H—1) at scales mu,d,s(2 GeV/c’), mc(mc) and mp(mp) respec- 
tively. 
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Table I-3. The leptons. 


Flavor Mass (GeV / c?) Charge Le L un Ly 
Ve < 0.2 x 1078 0 1 0 0 
e 5.10998928(11)x 1074 —e 1 0 O 
Vu < 1.9 x 1074 0 0 1 0 
u 0.1056583715(35) —e 0 1 0 
Vr < 0.0182 0 0 0 1 
T 1.77682(16) —e 0 0 1 


an overview of the quark mass spectrum. A useful benchmark for quark masses is 
the energy scale Agcp(~ several hundred MeV) associated with the confinement 
phenomenon. Relative to Agcp, the u, d, s quarks are light, the b, t quarks are 
heavy, and the c quark has intermediate mass. The dynamical behavior of light 
quarks is described by the chiral symmetry of massless particles (cf. Chap. VI) 
whereas heavy quarks are constrained by the so-called Heavy Quark Effective 
Theory (cf. Sect. XIII-3). Each quark is said to constitute a separate flavor, i.e. 
six quark flavors exist in Nature. The s, c, b, t quarks carry respectively the 
quantum numbers of strangeness (S), charm (C), bottomness (B), and topness (T). 
The u, d quarks obey an SU(2) symmetry (isospin) and are distinguished by the 
three-component of isospin (/3). The flavor quantum numbers of each quark are 
displayed in Table I-2. 


Leptons 


There are six leptons which fall into two categories according to their electrical 
charge. The charged leptons e, u, t have Q = — e and the neutrinos ve, Vu, Vr 
have Q =0. Leptons are also classified in terms of three lepton types: electron 
(ve, e), muon (v,,, 4), and tau (v+, T). This follows from the structure of the charged 
weak interactions (cf. Sect. II-3) in which these charged-lepton/neutrino pairs are 
coupled to W= gauge bosons. Associated with each lepton type is a lepton number 
Le, La, Lı. Table I-3 summarizes lepton properties. 


At this time, there is only incomplete knowledge of neutrino masses. Information 
on the mass parameters m,,, m vu My, is obtained from their presence in various 
weak transition amplitudes. For example, the single beta decay experiment °H > 
3He+e~ +9, is sensitive to the mass m, . In like manner, one constrains the masses 
m,, and my, in processes such as m+ —> pt + v, and t —> 207 + nt +v 
respectively. Existing bounds on these masses are displayed in Table I-3. 
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It is known experimentally that upon creation the neutrinos {vy} = (Ve.Vp, Vr) 
will not propagate indefinitely but will instead mix with each other. This means that 
the basis of states {v,} cannot be eigenstates of the hamiltonian. Diagonalization 
of the leptonic hamiltonian is carried out in Sect. VI-2 and yields the basis {v;} = 
{v1, v2, v3} of mass eigenstates. Information on the neutrino mass eigenvalues 
m,,™M2,m3 is Obtained from neutrino oscillation experiments and cosmological 
studies. Oscillation experiments (cf. Sects. VI-3,VI-4) are sensitive to squared- 
mass differences.* Throughout the book, we adhere to the following relations, 


definition: Am;, = m? — m$, convention: m2 > mı. (1.1) 
From the compilation in [RPP 12], the squared-mass difference |Am3,| deduced 


from the study of atmospheric and accelerator neutrinos gives 
Ame) = 2.321912 x 107° eV?, (1.2a) 


whereas data from solar and reactor neutrinos imply a squared-mass difference 
roughly 31 times smaller, 


Am}, = (7.50 + 0.20) x 107° eV’. (1.2b) 


Thus the neutrinos v; and v form a quasi-doublet. One speaks of a normal or 
inverted neutrino mass spectrum, respectively, for the cases* 


normal: m3 > M12, inverted: mı 2 > m3. (1.2c) 


Since the largest neutrino mass Migs, be it m2 or m3, cannot be lighter than the 
mass splitting of Eq. (1.2), we have the bound migs > 0.049 eV. Finally, a com- 
bination of cosmological inputs can be employed to bound the neutrino mass sum 
ae ı m;i, the precise bound depending on the chosen input data set. In one exam- 
ple [deP et al. 12], photometric redshifts measured from a large galaxy sample, cos- 
mic microwave background (CMB) data and a recent determination of the Hubble 
parameter are used to obtain the bound 


mı +m +m, < 0.26eV, (1.3a) 


whereas data from the CMB combined with that from baryon acoustic oscillations 
yields [Ad et al. (Planck collab.) 13] 


mı + m + m < 0.23 eV. (1.3b) 
A further discussion of the neutrino mass spectrum appears in Sect. VI—4. 
3 Only two of the mass differences can be independent, so Am‘, + Am}, + Am, =0. 
4 There is also the possibility of a quasi-degenerate neutrino mass spectrum (mı ~ m2 ~ m3), which can be 


thought of as a limiting case of both the normal and inverted cases in which the individual masses are 
sufficiently large to dwarf the |Am3,| splitting. 
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Quark and lepton numbers 


Individual quark and lepton numbers are known to be not conserved, but for dif- 
ferent reasons and with different levels of nonconservation. Individual quark num- 
ber is not conserved in the Standard Model due to the charged weak interactions 
(cf. Sect. II-3). Indeed, quark transitions of the type q; —> q; + W= induce the 
decays of most meson and baryon states and have led to the phenomenology of 
Flavor Physics. Individual lepton number is not conserved, as evidenced by the 
observed vy <> vg (a, B =e, u, T) oscillations. This source of this phenomenon is 
associated with nonzero neutrino masses. There is currently no additional evidence 


for the violation of individual lepton number despite increasingly sensitive limits 
such as the branching fraction B,,--..-e-e+ < 1.0 x 107". 

Existing data are consistent with conservation of total quark and total lepton 
number, e.g. the proton lifetime bound t, > 2.1 x 10? yr [RPP 12] and the nuclear 
half-life limit £}; ['2°Xe] > 1.6 x 10% yr [Ac et al. (EXO-200 collab.) 11]. These 
conservation laws are empirical. They are not required as a consequence of any 
known dynamical principle and in fact are expected to be violated by certain non- 
perturbative effects within the Standard Model (associated with quantum tunneling 
between topologically inequivalent vacua — see Sect. I—6). 


I—2 Chiral fermions 


Consider a world in which quarks and leptons have no mass at all. At first, this 
would appear to be a surprising supposition. To an experimentalist, mass is the 
most palpable property a particle has. It is why, say, a muon behaves differently 
from an electron in the laboratory. Nonetheless, the massless limit is where the 
Standard Model begins. 


The massless limit 


Let w(x) be a solution to the Dirac equation for a massless particle, 
ipw=0. (2.1) 


We can multiply this equation from the left by ys and use the anticommutativity of 
ys with y” to obtain another solution, 


iĝ ysy = 0. (2.2) 


We superpose these solutions to form the combinations 


1 1 
Pr = Te VES (2.3) 
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where ‘1’ represents the unit 4 x 4 matrix. The quantities Yz and Wr are solutions 
of definite chirality (i.e. handedness). For a massless particle moving with precise 
momentum, these solutions correspond respectively to the spin being anti-aligned 
(left-handed) and aligned (right-handed) relative to the momentum. In other words, 
chirality coincides with helicity for zero-mass particles. The matrices Mr =(1+ 
ys)/2 are chirality projection operators. They obey the usual projection operator 
conditions under addition, 


Pp +Tr=1, (2.4) 
and under multiplication, 
rp, =T,, rRIR=TR, rrr =FR} = 0. (2.5) 


In the massless limit, a particle’s chirality is a Lorentz-invariant concept. For 
example, a particle which is left-handed to one observer will appear left-handed to 
all observers. Thus chirality is a natural label to use for massless fermions, and a 
collection of such particles may be characterized according to the separate numbers 
of left-handed and right-handed particles. 

It is simple to incorporate chirality into a lagrangian formalism. The lagrangian 
for a massless noninteracting fermion is 


L=iw py, (2.6) 
or in terms of chiral fields, 
L= Lrt La, (2.7) 
where 
Li r =i RÊ WLR- (2.8) 
The lagrangians £z, g are invariant under the global chiral phase transformations 
WL rR) > exp(—iær R) YL, R), (2.9) 


where the phases a; rz are constant and real-valued but otherwise arbitrary. Antici- 
pating the discussion of Noether’s theorem in Sect. I—4, we can associate conserved 
particle-number current densities J a R> 


Jee =Virv' via Oir =9), (2.10) 


with this invariance. From these chiral current densities, we can construct the vector 
current V” (x), 


Vt = JE + Jk (2.11) 


and the axial-vector current A” (x), 
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AY agi = ds. (2.12) 

Chiral charges Qz, r are defined as spatial integrals of the chiral charge densities, 
Orn) = f ax Jf gl) 2.13) 


and represent the number operators for the chiral fields Yz, R. They are time- 
independent if the chiral currents are conserved. One can similarly define the vector 
charge Q and the axial-vector charge Qs, 


Qt) = fas V tx), Qs(t) = [as A°(x). (2.14) 
The vector charge Q is the total number operator, 
Q = Qr+ QL, (2.15) 
whereas the axial-vector charge is the number operator for the difference 
Os = Qr — Qr. (2.16) 


The vector charge Q and axial-vector charge Qs simply count the sum and differ- 
ence, respectively, of the left-handed and right-handed particles. 


Parity, time reversal, and charge conjugation 


The field transformations of Eq. (2.9) involve parameters œz rp which can take on 
a continuum of values. In addition to such continuous field mappings, one often 
encounters a variety of discrete transformations as well. Let us consider the oper- 
ations of parity 


x = (x9, x) > xp = (x?, —x), (2.17) 
and of time reversal 
x = (x°, x) > xr = (—x°, x), (2.18) 


as defined by their effects on spacetime coordinates. The effect of discrete trans- 
formations on a fermion field y(x) will be implemented by a unitary operator P 
for parity and an antiunitary operator T for time reversal. In the representation of 
Dirac matrices used in this book, we have 


Py(x)P =y paer), TYT =iy'y y (xr). (2.19) 


An additional operation typically considered in conjunction with parity and time 
reversal is that of charge conjugation, the mapping of matter into antimatter, 


CYC! = iy yh (x), (2.20) 


8 Inputs to the Standard Model 


Table I-4. Response of Dirac bilinears to 
discrete mappings. 


C P T 
S(x) S(xp) S(xr) 
P(x) —P(xp) —P(xr) 
=J) Ju (xP) Ju (xr) 
JE (x) — Jsu (xP) Jsu (xr) 
=T" (x) Tv (xp) —Tyw (xr) 


where Vp = yj Yop (a, B=1,...,4). The spacetime coordinates of field w(x) are 
unaffected by charge conjugation. 

In the study of discrete transformations, the response of the normal-ordered 
Dirac bilinears 


sœ) =: WO)W): P(x) =: W@)ysb (x) : 
Jœ) =: Yæ)" wr): JE (x) =: Yy" ys a) : (2.21) 
T(x) =: (xo W(x) : 


is of special importance to physical applications. Their transformation properties 
appear in Table I-4. Close attention should be paid there to the location of the 
indices in these relations. Another example of a field’s response to these discrete 
transformations is that of the photon A“ (x), 


C A#(x) C7! c = —A*(x), P A" (x) P! = A, (xp), Ho) 
T A*(x) T! c= A,y(xr). mA 
Beginning with the discussion of Noether’s theorem in Sect. 1—4, we shall explore 
the topic of invariance throughout much of this book. It suffices to note here that 
the Standard Model, being a theory whose dynamical content is expressed in terms 
of hermitian, Lorentz-invariant lagrangians of local quantum fields, is guaranteed 
to be invariant under the combined operation CPT. Interestingly, however, these 
discrete transformations are individually symmetry operations only of the strong 
and electromagnetic interactions, but not of the full electroweak sector. We see 
already the possibility for such behavior in the occurrence of chiral fermions wz, r, 
since parity maps the fields wz, into each other, 


Wier > P Yr r(x) Po! = y’ YR (p). (2.23) 
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Thus any effect, like the weak interaction, which treats left-handed and right- 
handed fermions differently, will lead inevitably to parity-violating phenomena. 


I-3 Fermion mass 


Although the discussion of chiral fermions is cast in the limit of zero mass, fermions 
in Nature do in fact have nonzero mass and we must account for this. In a lagran- 
gian, a mass term will appear as a hermitian, Lorentz-invariant bilinear in the fields. 
For fermion fields, these conditions allow realizations referred to as Dirac mass and 
Majorana mass.” 


Dirac mass 


The Dirac mass term for fermion fields yr involves the bilinear coupling of fields 
with opposite chirality 


-Lp = mpl YL Yr + YrYL]=mp vw (3.1) 


where YW = Wr + Wr and mp is the Dirac mass. The Dirac mass term is invari- 
ant under the phase transformation w(x) — exp(—ia)y(x) and thus does not 
upset conservation of the vector current V“ = Yy” y and the corresponding num- 
ber fermion operator Q of Eq. (2.15). All fields in the Standard Model, save pos- 
sibly for the neutrinos, have Dirac masses obtained from their interaction with the 
Higgs field (cf. Sects. II-3, I—4). Although right-handed neutrinos have no cou- 
plings to the Standard Model gauge bosons, there is no principle prohibiting their 
interaction with the Higgs field and thus generating neutrino Dirac masses in the 
same manner as the other particles. 


Majorana mass 


A Majorana mass term is one which violates fermion number by coupling two 
fermions (or two antifermions). In the Majorana construction, use is made of the 
charge-conjugate fields, 


Y= Cy’, bir) = Try), (3.2) 
where C is the charge-conjugation operator, obeying 
C=-C!'=-C'=-C". (3.3) 


In the Dirac representation of gamma matrices (cf. App. C), one has C =iy7y°. 
Some useful identities involving w° include 


5 We suppress spacetime dependence of the fields in this section. 
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WD vj HV Cy, Wi pi = Wi" C y}, 
ee ae 
(Dv) =r Whew) =- ew, ay 
VW) Yi = W Vi. YW) yt Wi = -v y" Yi, 
(Wr) WL =0, (Wr) V" Wr =Q. 


The two identities in the bottom line follow from r RCT; =0. 

The possibility of a Majorana mass term follows from the fact that a combination 
of two fermion fields YC w is an invariant under Lorentz transformations. Two 
equivalent expressions for a Majorana mass term involving chiral fields Yz, g are® 


—Ly = Si [Ow Wir + VLR Wre] 
(3.5) 
= TE [pr Cyr — Via) Chi e]. 


Because the cross combination (Wr)! Cy, =0, the Majorana mass terms involves 
either two left-chiral fields or two right-chiral fields, and the left-chiral and right- 
chiral masses are independent. Treating y and y* as independent variables, the 
resulting equations of motion are 


iĝ Yr — mpr YR =O, i WR — mpr Wr =Q, (3.6) 


with a similar set of equations for yw ,. Iteration of these coupled equations shows 
that mp indeed behaves as a mass. 

A Majorana mass term clearly does not conserve fermion number and mixes 
the particle with its antiparticle. Indeed, a Majorana fermion can be identified with 
its own antiparticle. This can be seen, using wr as an example, by rewriting the 
lagrangian in terms of the self-conjugate field 


1 
Yu = W [Wr T YR] , (3.7) 
which, given the equations of motion above, will clearly satisfy the Dirac equa- 
tion. The total Majorana lagrangian can be simply rewritten in terms of this self- 
conjugate field as 


Ler + Li = Vaid Wr- E [UR ve + Ve We) | 


= Wyip Wu — MRY m Wm (3.8) 
= Y Ci Ym — mr YC Ym, 


6 The factor of 1 /2 with the Majorana mass parameters m m compensates for a factor of 2 encountered in 
taking the matrix element of the Majorana mass term. 
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where again the identity [pg CI’; =0 plays a role in this construction. To avoid 
the possibility of nonconservation of charge, any Majorana mass term must be 
restricted to a field with is neutral under the gauge charges. Thus, among the par- 
ticles of the Standard Model, we will see that only right-handed neutrinos satisfy 
this condition. 

Finally, we note that a Dirac field can be written as two Majorana fields with 
opposite masses via the construction 


ol 1 


Ya = 5 [Vr +v]: We 5 [vi — ve]. (3.9) 
in which case we find 
-Lp = mp |Y, Yr + Yr Y] 
mp —— mp —— (3.10) 
=r (WE) Wa — > (We) Wo + h.c. 


The apparent violation of lepton number that looks like it would arise from this 
framework does not actually occur, because the effects proportional to the mass 
of these fields will cancel due to the minus sign between the two mass terms. To 
make matters look even more puzzling, we can flip the sign on the mass term for 
the second field, by the field redefinition Yp — i Wp, in which case both masses 
appear positive. However in this case, the weak current would pick up an unusual 
factor of i, since the left-handed field would then become 


1 
Wi > [i o+ wal. (3.11) 


In this case, potential lepton-number violating processes would cancel between the 
two fields because of the occurrence of a factor of i? = — 1 from the application 
of the weak currents. These algebraic gymnastics become more physically relevant 
when we combine both Dirac and Majorana mass terms in Chap. VI. 


I-4 Symmetries and near symmetries 


A symmetry is said to arise in Nature whenever some change in the variables of 
a system leaves the essential physics unchanged. In field theory, the dynamical 
variables are the fields, and symmetries describe invariances under transformations 
of the fields. For example, one associates with the spacetime translation x, > x, + 
a, a transformation of the field y(x) to Y (x + a). In turn, the ‘essential physics’ 
is best described by an action, at least in classical physics. If the action is invariant, 
the equations of motion, and hence the classical physics, will be unchanged. The 
invariances of quantum physics are identified by consideration of matrix elements 
or, equivalently, of the path integral. We begin the study of symmetries here by 
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exploring several lagrangians which have invariances and by considering some of 
the consequences of these symmetries. 


Noether currents 


The classical analysis of symmetry focusses on the lagrangian, which in general 
is a Lorentz-scalar function of several fields, denoted by g;, and their first deriva- 
tives 99i, i.e. L = L(G;, 3 pi). Noether’s theorem states that for any invariance of 
the action under a continuous transformation of the fields, there exists a classical 
charge Q which is time-independent (O = 0) and is associated with a conserved 
current, 0,,J“ =0. This theorem covers both internal and spacetime symmetries. 
For most’ internal symmetries, the lagrangian is itself invariant. Given a continu- 
ous field transformation, one can always consider an infinitesimal transformation 


G(x) = pi Œ) + fi), (4.1) 


where e€ is an infinitesimal parameter and f;(~) is a function of the fields in the 
theory. The procedure for constructing the Noether current of an internal symmetry 
is to temporarily let € become a function of x and to define the quantity 


ĝia) = Gi (x) + €(x) fil), (4.2) 


such that in the restriction back to constant €, £ becomes invariant and ¢;(x) > 
g; (x). For an internal symmetry, the Noether current is then defined by 


0 
J"(x) = —— LG, a9). (4.3) 
3 (ðE (x) 
Use of the equation of motion together with the invariance of the lagrangian 
under the transformation in Eq. (4.1) yields 3, J” = ƏL/ðe (x) =0 as desired. The 
Noether charge Q = f d?x Jo is time-independent if the current vanishes suffi- 
ciently rapidly at spatial infinity, i.e. 


dQ _ 
a 
We refer the reader to field theory textbooks for further discussion, including the 
analogous procedure for constructing Noether currents of spacetime symmetries. 
Identifying the current does not exhaust all the consequences of a symmetry but 
is merely the first step towards the implementation of symmetry relations. Notice 
that we have been careful to use the word ‘classical’ several times. This is because 
the invariance of the action is not generally sufficient to identify symmetries of a 
quantum theory. We shall return to this point. 


d°x AJo= — J d’xV.-J=0. (4.4) 


7 An exception occurs for the so-called topological gauge symmetries. 
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Examples of Noether currents 


Let us now consider some explicit field theory models in order to get practice in 
constructing Noether currents. 


(i) Isospin symmetry: SU (2) isospin invariance of the nucleon—pion system pro- 
vides a standard and uncomplicated means for studying symmetry currents. Con- 
sider a doublet of nucleon fields 

Y= (2) , (4.5) 
n 


and a triplet of pion fields m = {xŻ} (i = 1, 2, 3) with lagrangian 
Ee 1 2 ooo. pe À 2 
L=Ņ (ip -m) y +5 [ður dx -m T -x| +igyt-mysy — gE 
(4.6) 
where m is the nucleon mass matrix 
m=- (” 0 
0m 
and t = {tŻ }(i = 1, 2, 3) are the three Pauli matrices. This lagrangian is invariant 
under the global SU (2) rotation of the fields 
vow =Uy, U =exp(-it-«/2) (4.7) 
for any a', (i = 1, 2, 3) provided the pion fields are transformed as 
txt n =Ut- nul. (4.8) 


In proving this, it is useful to employ the identity 
1 
A eee) (4.9) 


from which we easily see that mimi is invariant under the transformation of 
Eq. (4.8). The response of the individual pion components to an isospin transfor- 
mation can be found from multiplying Eq. (4.8) by t’ and taking the trace, 


m'i = Ri (æn, Ri (a) = sir (Uru): (4.10) 


To determine the isospin current, one considers the spacetime-dependent trans- 
formation with a now infinitesimal, 


ý = (1 — it a(x)/2) Y, Rİ = ni — eR niat (x). (4.11) 
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Performing this transformation on the lagrangian gives 
ae jia 2 F 
LO, R) = Lp, 2) + SW" t : apay — e Our yato, (4.12) 


and applying our expression Eq. (4.3) for the current yields the triplet of currents 
(one for each a;) 


i ~ ot! ijk j k 
v= Vz ¥ FHE T 0,7". (4.13) 


By use of the equations of motion for y and z, it is straightforward to verify that 
this current is conserved. 

Gi) The linear sigma model: With a few modifications the above example 
becomes one of the most instructive of all field theory models, the sigma model 
[GeL 60]. One adds to the lagrangian of Eq. (4.6) a scalar field o with judiciously 
chosen couplings, and removes the bare nucleon mass, 


= 1 1 
L= Vip + 5a. - On + 50,09" 
> y (4.14) 
— gh 0 -it ny) y+ (0? +m?) - 3H). 


For u? > 0, the model exhibits the phenomenon of spontaneous symmetry break- 
ing (cf. Sect. I-6). In describing the symmetries of this lagrangian, it is useful to 
rewrite the mesons in terms of a matrix field 


D=o +it-n, (4.15) 
such that 
o+n = T (ZE). (4.16) 
Then we obtain 


L= Vipin + Vaip ve + Tr (LEE) 
1 5, : (4.17) 
4 nA Tr (£E) — w (E'E) — g (PLEWYr + PRE YL), 


where wWz.r are chiral fields (cf. Eq. (2.3)). The left-handed and right-handed 
fermion fields are coupled together only in the interaction with the & field. The 
purely mesonic portion of the lagrangian is obviously invariant under rotations 
among the ø, x fields. The full lagrangian has separate ‘left’ and ‘right’ invari- 
ances, i.e. SU(2); x SU (2)pr, 


Wir > Wi r = ULRVL..R: E > Y'= ULU}, (4.18) 


with Uz and Up being arbitrary SU (2) matrices, 
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UL r = exp (—ia; r: T/2). (4.19) 


The fermion portions of the transformation clearly involve just the SU (2) isospin 
rotations on the left-handed and right-handed fermions. However, the mesons 
involve a combination of a pure isospin rotation among the x fields together with 
a transformation between the o and x fields 


o> = ST (uLu}) o+ iT (rT) xk 
Mo + 5 (ærer) T, 
teonk= =i (tuu) o+ ST (Auru) ad 
~rt — : (a, — a) o — oe (a7 +a), (4.20) 


where the second form in each case is for infinitesimal œz, œg. For each invariance 
there is a separate conserved current 


OOE i ; 
Jin = VLYu z VL = git (c* (na, 2! — 3,2 z")) 
7 ái 1 k k 1 klem £ m 
= WLY uy VL z = ( 0,0" — T 0.0) + z£ M OT, 


2 (4.21) 


k . 
Jku = Davy Vr + st (c* (Ə E E — D1a,5)) 


= . 7 1 1 
— VRYu g ÝR + 7 (oð 7t = zð) + zen apn". 
These can be formed into a conserved vector current 
k 
= T 
Vi = Jin + Jiu = Pyu Y+ ee gn", (4.22) 


which is just the isospin current derived previously, and a conserved axial-vector 
current 


A =J -Jk xy a ka o — 09,0" 4.23 
u “Ly Ru — My +n pO — OTT. (4.23) 


(iii) Scale invariance: Our third example illustrates the case of a spacetime trans- 
formation in which the lagrangian changes by a total derivative. Consider classical 
electrodynamics (cf. Sect. H—1) but with a massless electron, 


1 _ 
L= FFF + 0 ipy, (4.24) 


where w is the electron field, D, = (0, + ieA,)w is the covariant derivative of 
w, A, is the photon field, and F,» is the electromagnetic field strength. We shall 
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describe the construction of both D, w and Fu» in the next section. Since there 
are no dimensional parameters in this lagrangian, we are motivated to consider 
the effect of a change in coordinate scale x — x’= àx together with the field 
transformations 


W(x) > W(x) = VP WAX), Ay (x) > Al Œ) =AA, (Ax). (4.25) 
Although the lagrangian itself is not invariant, 
L(x) > L(x) = MLA), (4.26) 


with a change of variable the action is easily seen to be unchanged, 
s= | trew > f ateacox= fax ca’) =. (4.27) 


There is nothing in this classical theory which depends on how length is scaled. 
The Noether current associated with the change of scale is 


I=. (4.28) 


scale 


where 0”” is the energy-momentum tensor of the theory, 


1 = | o- 
0” = =g" pF a T iipy | — F" F”, AN + svt Oy. 
(4.29) 


Since the energy-momentum tensor is itself conserved, 9 ,0”” = 0, the conservation 
of scale current is equivalent to the vanishing of the trace of the energy-momentum 
tensor, 


nde 


scale ~ 


6 =0. (4.30) 


This trace property may be easily verified using the equations of motion. 


Approximate symmetry 


Thus far, we have been describing exact symmetries. Symmetry considerations 
are equally useful in situations where there is ‘almost’ a symmetry. The very 
phrase ‘approximate symmetry’ seems self-contradictory and needs explanation. 
Quite often a lagrangian would have an invariance if certain of the parameters in it 
were set equal to zero. In that limit the invariance would have a set of physical con- 
sequences which, with the said parameters being nonzero, would no longer obtain. 
Yet, if the parameters are in some sense ‘small’, the predicted consequences are 
still approximately valid. In fact, when the interaction which breaks the symmetry 
has a well-defined behavior under the symmetry transformation, its effect can gen- 
erally be analyzed in terms of the basis of unperturbed particle states by using the 
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Wigner—Eckart theorem. The precise sense in which the symmetry-breaking terms 
can be deemed small depends on the problem under consideration. In practice, the 
utility of an approximate symmetry is rarely known a priori, but is only evident 
after its predictions have been checked experimentally. 

If a symmetry is not exact, the associated currents and charges will no longer 
be conserved. For example, in the linear sigma model, the symmetry is partially 
broken if we add to the lagrangian a term of the form 

Lacs IN (4.31) 
where & is the matrix defined in Eq. (4.15). With this addition, the vector isospin 
SU (2) symmetry remains exact but the axial SU (2) transformation is no longer an 
invariance. The axial-current divergence becomes 


OA =n, (4.32) 
and the charge is time-dependent, 
dQ! i 
L za faxr. (4.33) 
dt 


In the linear sigma model, if the parameters g, à are of order unity it is clear that 
the perturbation is small provided 1 >> a/p’, as n is the only other mass scale in 
the theory. However, if either g or A happens to be anomalously large or small, the 
condition appropriate for a ‘small’ perturbation is not a priori evident. 

In our example (iii) of scale invariance in massless fermion electrodynamics, the 
addition of an electron mass 


Lmass = -myy (4.34) 
would explicitly break the symmetry and the trace would no longer vanish, 
0t =myýyy £0. (4.35) 


This is in fact what occurs in practice. Fermion mass is typically not a small param- 
eter in QED and cannot be treated as a perturbation in most applications. 


I-5 Gauge symmetry 


In our discussion of chiral symmetry, we considered the effect of global phase 
transformations, Wy R(x) > exp (—idz.r)Wz.r(x). Global phase transformations 
are those which are constant throughout all spacetime. Let us reconsider the system 
of chiral fermions, but now insist that the phase transformations be local. Each 
transformation is then labeled by a spacetime-dependent phase ay, r(x), 


Wr, R(x) > exp(—iay r(X)) YL RX). (5.1) 
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Such local mappings are referred to as gauge transformations. The free massless 
lagrangian of Eq. (2.1) is not invariant under the gauge transformation 


iW ROPY RE) > iY ROA VLRO) + Wr px) y" Wi, r(x) darr), 
(5.2) 


because of the spacetime dependence of a, x. In order for such a local transforma- 
tion to be an invariance of the lagrangian, we need an extended kind of derivative 
D,,, Such that 


D Yr, r(x) > exp(~iær rRx))D YL, r(x) (5.3) 


under the local transformation of Eq. (5.1). The quantity D, is a covariant deriva- 
tive, so called because it responds covariantly, as in Eq. (5.3), to a gauge trans- 
formation. 


Abelian case 


Before proceeding with the construction of a covariant derivative, we broaden the 
context of our discussion. Let © (x) now represent a boson or fermion field of any 
spin and arbitrary mass. We consider transformations 


0 > U(ajO (5.4) 
D,@ —> U(e)D, ©, (5.5) 


with a spacetime-dependent parameter, œ = æ (x). Suppose these gauge transfor- 
mations form an abelian group, e.g., as do the set of phase transformations of 
Eq. (5.1). It is sufficient to consider transformations with just one parameter as 
in Eqs. (5.4)-(5.5) since we can use direct products of these to construct arbitrary 
abelian groups. 

One can obtain a covariant derivative by introducing a vector field A,, (x), called 
a gauge field, by means of the relation 


D,© = (a +ifA,)9, (5.6) 


where f is a real-valued coupling constant whose numerical magnitude depends in 
part on the field ©. For example, in electrodynamics f becomes the electric charge 
of ©. The problem is then to determine how A, must transform under a gauge 
transformation in order to give Eq. (5.5). This can be done by inspection, and we 
find 


Ay > A+ TUO . U~! (æ). (5.7) 


8 An abelian group is one whose elements commute. A nonabelian group is one which is not abelian. 
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The gauge field A,, must itself have a kinetic contribution to the lagrangian. This 
is written in terms of a field strength, F», which is antisymmetric in its indices. 
A general method for constructing such an antisymmetric second rank tensor is to 
use the commutator of covariant derivatives, 


Di, D] O = if Faw. (5.8) 
By direct substitution we find 
Fuy = 0pAy — Ag. (5.9) 


It follows from Eq. (5.7) and Eq. (5.9) that the field strength F,» is invariant under 
gauge transformations. A gauge-invariant lagrangian containing a complex scalar 
field g and a spin one-half field y, chiral or otherwise, has the form 


1 — 
p= -g Ew F" + (D'o) D o + iv DY +-->, (5.10) 


where the ellipses stand for possible mass terms and nongauge field interactions. 
There is no contribution corresponding to a gauge-boson mass. Such a term would 
be proportional to A“ A,,, which is not invariant under the gauge transformation, 
Eq. (5.7). 


Nonabelian case 


The above reasoning can be generalized to nonabelian groups [YaM 54]. First, we 
need a nonabelian group of gauge transformations and a set of fields which forms a 
representation of the gauge group. Then, we must construct an appropriate covari- 
ant derivative to act on the fields. This step involves introducing a set of gauge 
bosons and specifying their behavior under the gauge transformations. Finally, the 
gauge field strength is obtained from the commutator of covariant derivatives, at 
which point we can write down a gauge-invariant lagrangian. 

Consider fields © = {0;} (i =1, ..., 7), which form an r-dimensional represen- 
tation of a nonabelian gauge group G. The ©; can be boson or fermion fields of 
any spin. In the following it will be helpful to think of © as an r-component col- 
umn vector, and operations acting on © as r x r matrices. We take group G to 
have a Lie algebra of dimension n, so that the numbers of group generators, group 
parameters, gauge fields, and components of the gauge field strength are each n. 
We write the spacetime-dependent group parameters as the n-dimensional vector 
a ={a,(x)} (a=1,...,n). A gauge transformation on @ is 


0’ = U@)9, (5.11) 
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where the r x r matrix U is an element of group G. For those elements of G which 
are connected continuously to the identity operator, we can write 


U(@) = exp(—ia’G"), (5.12) 


where G = {Ga} (a=1,...,n) are the generators of the group G expressed as 
hermitian r x r matrices. The set of generators obeys the Lie algebra 


[G*,G’] =ict”"G° (a,b,c = 1,...,n), (5.13) 
where {c°} are the structure constants of the algebra. We construct the covariant 
derivative D,,© in terms of gauge fields B, = {Bi} (a=1,...,n) as 

DLO = (Ið, + igB,)0, (5.14) 


where g is a coupling constant analogous to f in Eq. (5.6). In Eq. (5.14), I is the 
r xr unit matrix, and 


B, = G° B}. (5.15) 

Realizing that the covariant derivative must transform as 
(D,,0)' = UG ,9), (5.16) 
we infer from Eqs. (5.12)—(5.14) the response, in matrix form, of the gauge fields, 


B’, = U(@)B,U '@) + ~,U@) -U'(@). (5.17) 


The field strength matrix F „, is found, as before, from the commutator of covariant 
derivatives, 


[D,, D, JO = igF,,,9, (5.18) 
implying 
Fav = 0,B, — 0,B, + ig[B,, B,]. (5.19) 
Eqs. (5.17) and (5.19) provide the field strength transformation property, 
F, = U@F,,,U '(@). (5.20) 


Unlike its abelian counterpart, the nonabelian field strength is not gauge invariant. 
Finally, we write down the gauge-invariant lagrangian 


1 =, 
L= ST PYF») + D“O)D,o + VD +., Gan 


where ® and W are distinct multiplets of scalar and spin one-half fields and the 
ellipses represent possible mass terms and nongauge interactions. Analogously to 
the abelian case, there is no gauge-boson mass term. 
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The most convenient approach for demonstrating the theory’s formal gauge struc- 
ture is the matrix notation. However, in specific calculations it is sometimes more 
convenient to work with individual fields. To cast the matrix equations into compo- 
nent form, we employ a normalization of group generators consistent with 
Eq. (5.21), 


1 
Tr(G‘G’) = 8” (a,b =1,...,n). (5.22) 
To obtain the a component of the field strength Ps ={F avl (a=1,...,n), we 
matrix multiply Eq. (5.19) from the left by G“ and take the trace to find 
Fi = ðu B5 — 3y BS = ge" RB,  (a,b,c=1,...,n). (5.23) 


The lagrangian Eq. (5.21) can likewise be rewritten in component form, 


1 oee 
L=- 7 FF, + (Din Pm)! (Din Gn + iW: D)ijiwj +--+, (5.24) 


where a=1,...,n and the remaining indices cover the dimensionalities of their 
respective multiplets. 


Mixed case 


In the Standard Model, it is a combination of abelian and nonabelian gauge groups 
which actually occurs. To deal with this circumstance, let us consider one abelian 
gauge group G and one nonabelian gauge group Q’ having gauge fields A” and 
Be= {B“} (a=1,...,n), respectively. Further assume that G and G’ commute 
and that components of the generic matter field © transform as an r-dimensional 
multiplet under G’. The key construction involves the generalized covariant deriva- 
tive, written as anr x r matrix, 


D,0 = (O, +ifA, I+ igB,, - G) Ə, (5.25) 


where I is the unit matrix and f, g are distinct real-valued constants. Given this, 
much of the rest of the previous analysis goes through unchanged. The field 
strengths associated with the abelian and nonabelian gauge fields have the forms 
given earlier. So does the gauge-invariant lagrangian, except now the extended 
covariant derivative of Eq. (5.25) appears, and both the abelian and nonabelian 
field strengths must be included. For the theory with distinct multiplets of complex 
scalar fields ® and spin one-half fields Y, the general form of the gauge-invariant 
lagrangian is 
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1 1 = = 
f= =- in EL (FF) +i VDY — Ymy 
+ (D“6)'D,,. ® — V (181) + L(Y, ®), (5.26) 


where m is the fermion mass matrix, V(|®|*) contains the ® mass matrix and any 
polynomial self-interaction terms, and L(Y, ®) describes the coupling between 
the spin one-half and spin zero fields. 


I-6 On the fate of symmetries 


Depending on the dynamics of the theory, a given symmetry of the lagrangian can 
be manifested physically in a variety of ways. Apparently all such realizations are 
utilized by Nature. Here we list the various possibilities. 


(1) The symmetry may remain exact. The electromagnetic gauge U(1) symmetry, 
the SU(3) color symmetry of QCD, and the global ‘baryon-number minus 
lepton-number’ (B — L) symmetry are examples in this class. 

(2) The apparent symmetry may have an anomaly. In this case it is not really a true 
symmetry. Within the Standard Model the global axial U(1) symmetry is thus 
affected. Our discussion of anomalies is given in Sect. I-3. 

(3) The symmetry may be explicitly broken by terms (perhaps small) in the 
lagrangian which are not invariant under the symmetry. Isospin symmetry, bro- 
ken by electromagnetism and light-quark mass difference, is an example. 

(4) The symmetry may be ‘hidden’ in the sense that it is an invariance of the 
lagrangian but not of the ground state, and thus one does not ‘see’ the sym- 
metry in the spectrum of physical states. This can be produced by different 
physical mechanisms. 

(a) The acquiring of vacuum expectation values by one or more scalar fields in 
the theory gives rise to a spontaneously broken symmetry, as in the breaking 
of SU (2), invariance by Higgs fields in the electroweak interactions. 

(b) Even in the absence of scalar fields, quantum effects can lead to the dynam- 
ical breaking of a symmetry. Such is the fate of chiral SU(2); x SU(2)rR 
symmetry in the strong interactions. 


The various forms of symmetry breaking in the above are quite different. In partic- 
ular, the reader should be warned that the word ‘broken’ is used with very different 
meanings in case (3) and the cases in (4). The meaning in (3) is literal — what would 
have been a symmetry in the absence of the offending terms in the lagrangian is 
not a symmetry of the lagrangian (nor of the physical world). Although the usage 
in (4) is quite common, it is really a malapropism because the symmetry is not 
actually broken. Rather, it is realized in a special way, one which turns out to have 
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important consequences for a number of physical processes. The situation is some- 
what subtle and requires more explanation, so we shall describe its presence in a 
magnetic system and in the sigma model. 


Hidden symmetry 


The phenomenon of hidden symmetry occurs when the ground state of the theory 
does not have the full symmetry of the lagrangian. Let Q be a symmetry charge as 
inferred from Noether’s theorem, and consider a global symmetry transformation 
of the vacuum state 


|0) > e'*°|0), (6.1) 
where œ is a continuous parameter. Invariance of the vacuum, 
e210) =|0) ~— (alla), (6.2a) 
implies that 
Q|0) = 0. (6.2b) 


In this circumstance, the vacuum is unique and the symmetry manifests itself in the 
‘normal’ fashion of mass degeneracies and coupling constant identities. Such is the 
case for the isospin symmetric model of nucleons and pions discussed in Sect. I-4, 
where the lagrangian of Eq. (4.6) implies the relations 


Mn = Mp, Mz+ =M,0 = My-, 


(6.3) 
g(ppm°) = —g(nnx°) = g(pnat)/V/2 = g(npa~)/V2, 


with m* = (7 F im) //2. 
Alternatively, if new states |æ) ~ |0) are reached via the transformations of 
Eq. (6.1), we must have 


Q|0) #0. (6.4) 
Since, by Noether’s theorem, the symmetry charge is time-independent, 
Ò = i[H, 0] =0, (6.5) 


all of the new states |œ) must have the same energy as |0). That is, if Eo is the 
energy of the vacuum state, H|O) = Eo|0), then we have 


Hla) = He'*2|0) = e'“2H|0) = Eola). (6.6) 


Because the symmetry transformation is continuous, there must occur a continuous 
family of degenerate states. 
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Can one visualize these new states in a physical setting? It is helpful to refer to a 
ferromagnet, which consists of separate domains of aligned spins. Let us focus on 
one such domain in its ground state. It is invariant only under rotations about the 
direction of spin alignment, and hence does not share the full rotational invariance 
of the hamiltonian. In this context, the degenerate states mentioned above are just 
the different possible orientations available to the lattice spins in a domain. Since 
space is rotationally invariant, there is no preferred direction along which a domain 
must be oriented. By performing rotations, one transfers from one orientation to 
another, each having the same energy. 

Let us try to interpret, from the point of view of quantum field theory, the states 
which are obtained from the vacuum by a continuous symmetry transformation and 
which share the energy of the vacuum state. In a quantum field theory any excita- 
tion about the ground state becomes quantized and is interpreted as a particle. The 
minimum excitation energy is the particle’s mass. Thus the zero-energy excitations 
generated from symmetry transformations must be described by massless particles 
whose quantum numbers can be taken as those of the symmetry charge(s). Thus 
we are led to Goldstone’s theorem [Go 61, GoSW 62] — if a theory has a contin- 
uous symmetry of the lagrangian, which is not a symmetry of the vacuum, there 
must exist one or more massless bosons (Goldstone bosons). That is, spontaneous 
or dynamical breaking of a continuous symmetry will entail massless particles in 
the spectrum. 

This phenomenon can be seen in the magnet analogy, where the excitation is a 
spin-wave quantum. When the wavelength becomes very large, the spin configura- 
tion begins to resemble a uniform rotation of all the spins. This is one of the other 
possible domain alignments discussed above, and to reach it does not cost any 
energy. Thus, in the limit of infinite wavelength (A — oo), the excitation energy 
vanishes (E — 0), yielding a Goldstone boson.” 


Spontaneous symmetry breaking in the sigma model 


We proceed to a more quantitative analysis of hidden symmetry by returning to 
the sigma model of Sect. I-4. Let us begin by inferring from the sigma model 
lagrangian of Eq. (4.14) the potential energy 


2 


V (0o, m) = = (o? + z’) + ; (o? + r?) ; (6.7) 


9 In the ferromagnet case, the spin waves actually have E œ p? ~ 272 for low momentum. In 
Lorentz-invariant theories, the form Æ œ |p| is the only possible behavior for massless single particle states. 
For a more complete discussion, see [An 84]. 
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With u? negative, minimization of V (ø, mx) occurs for the unique configuration 
o =x =0. Hidden symmetry occurs for u? positive, where minimization of 
V (o, m ) reveals the set of degenerate ground states to be those with 


2 
otre = (6.8) 


Let us study the particular ground state, 


2 
TR, — =o, wiped, (6.9) 


Other choices yield the same physics, but require a relabeling of the fields. For this 
case, field fluctuations in the pionic direction do not require any energy, so that the 
pions are the Goldstone bosons. Defining 


õ =0 — v, (6.10) 


we then have for the full sigma model lagrangian 
- 1 1 
L= y (iĝ — gv) Y + 5 [3 ő" — 2767] + 5 ui On 
= Xr 
-gp 6 -it mys) W -vð (5? +°) — 5 [e gaf = v*] , 6.11) 


Observe that the pion is massless, while the o and nucleon fields are massive. Thus, 
at least part of the original symmetry in the sigma model lagrangian of Eq. (4.14) 
appears to have been lost. Certainly, the mass degeneracy m, =m, is no longer 
present, although the normal pattern of isospin invariance survives. However, the 
full set of original symmetry currents remain conserved. In particular, the axial 
current of Eq. (4.23), which now appears as 

; z ti : . ; 

A, = WYuysz Y — vôu T + T'I — TIT, (6.12) 
still has a vanishing divergence, ð" Al, = 0. We warn the reader that to demonstrate 
this involves a complicated set of cancelations. 

For a normal symmetry, particles fall into mass-degenerate multiplets and have 
couplings which are related by the symmetry. The isospin relations in Eq. (6.3) 
are an example of this. In a certain sense, a hidden symmetry likewise gives rise 
to degenerate states whose couplings are related by the symmetry. The degeneracy 
consists of a state taken alone or accompanied by an arbitrary number of Goldstone 
bosons. For example, in the sigma model it can be a nucleon and the same nucleon 
accompanied by a zero-energy massless pion, which are degenerate. Moreover, 
the couplings of such configurations are restricted by the symmetry. Historically, 
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predictions of chiral symmetry were originally formulated in terms of soft-pion 
theorems (cf. App. B-3) relating the couplings of the N states to those of the degen- 
erate x N states. 
lim (*(q)N'|O|N) = = ANG OJIN), (6.13) 
q4u>0 Fy 
where O is some local operator and N, N’ are nucleons or other states. This cap- 
tures intuitively the nature of symmetry predictions for a hidden symmetry. In this 
book, we will explore such chiral relations using the more modern techniques of 
effective lagrangians. 

To summarize, if a symmetry of the theory exists but is not apparent in the single- 
particle spectrum, it still can have a great deal of importance in restricting parti- 
cle behavior. What happens is actually quite remarkable — in essence, symmetry 
becomes dynamics. One obtains information about the excitation or annihilation 
of particles from symmetry considerations. In this regard, hidden symmetries are 
neither less ‘real’ nor less useful than normal symmetries — they simply yield a 
different pattern of predictions. 


Problems 


(1) The Poincaré algebra 

(a) Consider the spacetime (Poincaré) transformations, x* — Ax" + a", 
where A’ A7” =g”. Associated with each coordinate transformation 
(a, A) is the unitary operator U (a, A) = exp(ia, P” — bep M””). For two 
consecutive Poincaré transformations there is a closure property, U (a', A’) 
U (a, A)=U(...). Fill in the dots. 

(b) Prove that U(a™!, 0)U (a', O)U (a, 0) =U (a', 0), and by taking a’,, a, 
infinitesimal, determine [P#, P”]. 

(c) Demonstrate that (A7 !)}» = A,,, and then show that 
U (0, A“!)U(a’, A)NU (0, A) =U (A7 ta', ATHA’ A). 

(d) For infinitesimal transformations we write A", ~ g^ + e4. Prove that 
€o} = —€,¢ and hence M,, = — M,,. Upon taking primed quantities in (c) 
to be infinitesimal, prove U (0, A~') P“U(0, A) = A# P” and U(O, A7!) 
MU (0, A) = A, A" M “8 Finally, letting unprimed quantities be infini- 
tesimal as well, determine [M°f, P“] and [M°®, M“”]. 

(2) The Meissner effect in gauge theory [Sh 81] 
The lagrangian for the electrodynamics of a charged scalar field is 


1 
Lo = —qFuwk™ + (Dug) (D" 9g) — V@) 


with covariant derivative D, = 0, +ieA, and potential energy, 


(a) 
(b) 


(c) 


(d) 


(e) 


Problems 2) 


m? * À K y2 

VO T Ptr (A > 0). 
Identify the electromagnetic current of the ¢ field. 
For m? > 0, show that the ground state is p =0, A, =O. In this case, the 
theory is that of normal electrodynamics. 
For m? < 0 (m? > —,? with u? > 0), we enter a different phase of the 
system. Show that the ground state is now gy = const. = v, A, =0. What 
is the photon mass in this phase? Calculate the potential between two static 
point charges each of value Q. What sets the scale of the screening length? 
Let us now add an external field to the system, 


1 v 
Lo => Lo + z Fu Fen š 


To see that Fh indeed acts like an applied field, show that if one disregards 
the field g the equations of motion require F“” = Fey. 

Demonstrate that there are two simple solutions to the equations of motion 
in the presence of a constant applied field, 


[o P=), 
P= ly (FY =0). 


Again, these correspond to unscreened and screened phases of the electro- 
magnetic field. 

Calculate the energy of the two phases if F4; describes a constant mag- 
netic field. Show that the phase in part (e) with g = 0 has the lower energy 
for B > Beritica Whereas for B < Beriticai it is the phase with g = v which 
has the lower energy. Discuss the similarity of this result to the Meissner 
effect. 
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Interactions of the Standard Model 


A gauge theory involves two kinds of particles, those which carry ‘charge’ and 
those which ‘mediate’ interactions between currents by coupling directly to charge. 
In the former class are the fundamental fermions and nonabelian gauge bosons, 
whereas the latter consists solely of gauge bosons, both abelian and nonabelian. 
The physical nature of charge depends on the specific theory. Three such kinds of 
charge, called color, weak isospin, and weak hypercharge, appear in the Standard 
Model. The values of these charges are not predicted from the gauge symmetry, but 
must rather be determined experimentally for each particle. The strength of cou- 
pling between a gauge boson and a particle is determined by the particle’s charge, 
e.g., the electron—photon coupling constant is —e, whereas the u-quark and pho- 
ton couple with strength 2e/3. Because nonabelian gauge bosons are both charge 
carriers and mediators, they undergo self-interactions. These produce substantial 
nonlinearities and make the solution of nonabelian gauge theories a formidable 
mathematical problem. Gauge symmetry does not generally determine particle 
masses. Although gauge-boson mass would seem to be at odds with the principle of 
gauge symmetry, the Weinberg—Salam model contains a dynamical procedure, the 
Higgs mechanism, for generating mass for both gauge bosons and fermions alike. 


H-1 Quantum Electrodynamics 


Historically, the first of the gauge field theories was electrodynamics. Its modern 
version, Quantum Electrodynamics (QED), is the most thoroughly verified phys- 
ical theory yet constructed. QED represents the best introduction to the Standard 
Model, which both incorporates and extends it. 


U(1) gauge symmetry 


Consider a spin one-half, positively charged fermion represented by field y. The 
classical lagrangian which describes its electromagnetic properties is 


28 
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len =] F? +y (ip—m) wv. (1.1) 


Here, the covariant derivative is D, w = (0, +ieA,)y, m and e are, respectively, 
the mass and electric charge for y, A,, is the gauge field for electromagnetism, F”” 
is the gauge-invariant field strength (cf. Eqs. I-5.8), (I-5.9)), and F? = F PE P is 
This lagrangian is invariant under the local U (1) transformations 


pa) > eM yr), (1.2) 
A (x) > Ay(x) + e7',00(x). (1.3) 


The associated equations of motion are the Dirac equation 
@p—m—eA)w=0, (1.4) 


and the Maxwell equation 
3 F” =ewy’w. (1.5) 


It is worthwhile to consider in more detail the important subject of U (1) gauge 
invariance, addressing both its extent and its limitations. 


(i) Universality of electric charge: The deflection of atomic and molecular beams 
by electric fields establishes that the fractional difference in the magnitude of elec- 
tron and proton charge is no larger than O(10~7°). Likewise, there is no evidence 
of any difference between the electric charges of the leptons e, u, t. Whatever the 
source of this charge universality may be, it is not the U(1) invariance of electro- 
dynamics. For example assume that in addition to w, there exists a second charged 
fermion field y’ with charge parameter £e. It is easy to see that gauge invariance 
alone does not imply 8 = 1. The electromagnetic lagrangian for the extended 
system is 


1 = = 
Lom ==7 +Y GP- m) y +Y (iP =m) W, (1.6) 


where D,Y’ = (0, + iBeA,(x))y’. The above lagrangian is invariant under the 
extended set of gauge transformations 


w(x) > eya), yr (x) > oP w' (x), 
A (x) > Ay(x) + e™l3 a(x). (1.7) 


This demonstration of gauge invariance is valid for arbitrary 6, and thus says noth- 
ing about its value. The U (1) symmetry is compatible with, but does not explain, 
the observed equality between the magnitudes of the electron and proton charges. 
We shall return to the issue of charge quantization in Sect. I—3 when we consider 
how weak hypercharge is assigned in the Weinberg—Salam model. 
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(ii) A candidate quantum lagrangian: The quantum version of Lem is in fact the 
most general Lorentz-invariant, hermitian, and renormalizable lagrangian which is 
U (1) invariant. Consider the seemingly more general structure 


1 = _ = E 
Leen = ager +iZrYr Dyr +iZLYL PYL — Myry — Mvp vr, (18) 


where Z, Zr, are constants, Į is the covariant derivative of Eq. (1.1), and M 
can be complex-valued. This lagrangian not only apparently differs from Lem, 
but seemingly is CP-violating due to the complex mass term. However, under the 
rescalings 


A= Ais, e = Ze, VEL = ZRLÝR L (1.9) 
we obtain 
1 — —/ ET 
Leen = “GF? +ivD Ww! — MV pw! — MU, (1.10) 


where M’ = (ZprZ,)~'/?M. A subsequent global chiral change of variable 
Wig =e p] R (æ = constant) (1.11) 


does not affect the covariant derivative term but modifies the mass terms, 


” | ’ ay N 1 dias") 1 2iay* yy 
= = r +iy Dy" -Meppy — (Meyy aI) 
Choosing the parameter œ so that Im (M'e”%) = 0 and defining m = Re(M'e”®), 


we see that L”. reduces to Lem which appears in Eq. (1.1). 


gen 
(iii) Renormalizability and U (1): Renormalizability plays a role in the preceding 
discussion because U (1) symmetry by itself would admit a larger set of interaction 
terms. In principle, U(1) invariant terms like Wort Fuy, Wy FH” Fws wyty” 
y“y’ y Fu Fog, etc. could appear in the QED lagrangian. However, they do not 
because the condition of renormalizability admits only those contributions which 
have dimension d < 4. As discussed in App. C-3, the canonical dimension of 
boson and fermion fields is d = 1, 3/2 respectively, and each derivative adds a unit 
of dimension. Accordingly, the above candidate operators have d = 5, 7, 7 and 
thus are ruled out. There remains an operator, FF H», which is gauge-invariant 
and has dimension 4.! A noteworthy aspect of this quantity is that, unlike the other 
operators encountered thus far, it is odd under CP. This follows from writing it as 
—4E - B and realizing that under CP, E —> E and B —> —B. However, a simple 
exercise shows that we can identify this operator as a four-divergence F ae ae 
ð K”, where KY = DehOP A 3x A g- Thus, a contribution proportional to F, or id 


1 We define the tensor FY which is dual to F“” as FEY = eh Fop/2. 
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can be of no physical consequence. Upon integration over spacetime, it becomes a 
surface term evaluated at infinity. There is nothing in the structure of QED which 
would cause such a surface term to be anything but zero. 


QED to one loop 


The perturbative expansion of QED is carried out about the free field limit, and is 
interpreted in terms of Feynman diagrams. Two distinct phenomena are involved, 
scattering and renormalization. The latter encompasses both an additive mass shift 
for the fermion (but not for the photon) and rescalings of the charge parameter 
and of the quantum fields. To carry out the calculational program requires a quan- 
tum lagrangian Lorp to establish the Feynman rules, a regularization procedure to 
interpret divergent loop integrals, and a renormalization scheme. 

One can develop QED using either canonical or path-integral methods. In either 
case a proper treatment necessitates modification of the classical lagrangian. As 
we have seen, the U(1) gauge symmetry implies a certain freedom in defining the 
A" (x) field. Regardless of the quantization procedure adopted, this freedom can 
cause problems. For canonical quantization, the procedure of selecting a complete 
set of coordinates and their conjugate momenta is upset by the freedom to gauge 
transform away a coordinate at any given time. For path integration, the integration 
over gauge copies of specific field configurations gives rise to specious divergences 
(cf. App. A—6). In either case, superfluous gauge degrees of freedom can be elim- 
inated by introducing an auxiliary condition which constrains the gauge freedom. 
There are a variety of ways to accomplish this. The one adopted here is to employ 
the following gauge-fixed lagrangian, 


1 1 E 
LOED = = = 5 Ay ae (if — eo — mo) W, (1.13) 
50 


where ep and mọ are, respectively, the fermion charge and mass parameters. The 
quantity éo is a real-valued, arbitrary constant appearing in the gauge-fixing term. 
This term is Lorentz-invariant but not U (1)-invariant. One of its effects is to make 
the photon propagator explicitly dependent on £o. The value &) = 1 corresponds to 
Feynman gauge, whereas the limit &) — 0 defines the Landau gauge. 

The zero subscripts on the mass, charge, and gauge-fixing parameters denote 
that these bare quantities will be subject to renormalizations, as will the quantum 
fields. This process is characterized in terms of quantities Z; and ôm, 

y = zZ} y", AZ A 
eo = ZZ Z e, mo = m — ôm, 
o = 236, (1.14) 
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where the superscript ‘r’ labels renormalized fields. The renormalization constants 
Z, Z2, and Z3 (associated respectively with the fermion—photon vertex, the fermion 
wavefunction, and the photon wavefunction) and the fermion mass shift ôm are 
chosen order by order to cancel the divergences occurring in loop integrals. For 
vanishing bare charge ey = 0, they reduce to Z1,2,3 = 1, ôm = 0. 


The Feynman rules for QED are: 


fermion—photon vertex: 


pene ieee 


=i eo (Yap 
(1.15) 
fermion propagator i Sag (p): 
l p 
i (Ø + Modes p — « 
r= mo + ie 
(1.16) 
photon propagator i D™ (q): 
; q 
L — ge” J a _ £o) q"q v n u 
q? tie q?’ +ie 
(1.17) 


In the above € is an infinitesimal positive number. 

The remainder of this section is devoted to a discussion of the one-loop radiative 
correction experienced by the photon propagator.” Throughout, we shall work in 
Feynman gauge. 

Let us define a proper or one-particle irreducible (1 PI) Feynman graph such 
that there is no point at which only a single internal line separates one part of 
the diagram from another part. The proper contributions to photon and to fermion 
propagators are called self-energies. The point of finding the photon self-energy 
is that the full propagator i D/,,, can be constructed via iteration as in Fig. II-1. 
Performing a summation over self-energies, we obtain 


~~ = nnn H ~S + ~~r +. 
Fig. II—1 The full photon propagator as an iteration. 


2 We shall leave calculation of the fermion self-energy to Prob. II-3 and analysis of the photon-fermion vertex 
to Sect. V-1. 
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p-q Pa 
(a) (b) (c) 


Fig. Il-2 One-loop corrections to (a) photon propagator, (b) fermion-photon 
vertex, and (c) fermion propagator. 


iD!, =iDyy +iDyaGT) iDgy +++ 


—i 1 at) se] 
z| [er= pega, 1.18 
q? f +I (q) (ex q? ai q? — 


where the proper contribution 


iT”? (gq) = (qq? — q'il) (1.19) 


is called the vacuum polarization tensor. It is depicted in Fig. H—2(a) (along 
with corrections to the photon-fermion vertex and fermion propagator in 
Figs. II—2(b)-(c)), and is given to lowest order by 


4 r Z 
ilI (q) = —(-ieoy? | oe Tr | me ; B l IF 


Or) |Y pom+ie’ p—gG—mtie 
(1.20) 


This integral is quadratically divergent due to singular high-momentum behavior. 
To interpret it and other divergent integrals, we shall employ the method of dimen- 
sional regularization [BoG 72, ’tHV 72, Le 75]. 

Accordingly, we consider I1%’(q) as the four-dimensional limit of a function 
defined in d spacetime dimensions. Various mathematical operations, such as 
summing over Lorentz indices or evaluating loop integrals, are carried out in d 
dimensions and the results are continued back to d = 4, generally expressed as an 
expansion in the variable? 


es- (1.21a) 


Formulae relevant to this procedure are collected in App. C-5. For all theories 
described in this book, we shall define the process of dimensional regularization 
such that all parameters of the theory (such as e°) retain the dimensionality they 


3 We shall follow standard convention is using the symbol e for both the infinitesimal employed in Feynman 
integrals and the variable for continuation away from the dimension of physical spacetime. 
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have for d = 4. In order to maintain correct units while dimensionally regularizing 
Feynman integrals, we modify the integration measure over momentum to 
d*p x f ap 
>u ; 
(27)* (27)! 


(1.21b) 


The parameter ju is an arbitrary auxiliary quantity having the dimension of a mass. 
It appears in the intermediate parts of a calculation, but cannot ultimately influ- 
ence relations between physical observables. Indeed, there exist in the literature a 
number of variations of the extension to d 4 4 dimensions. These are able to yield 
consistent results because one is ultimately interested in only the physical limit 
of d = 4. Let us now return to the photon self-energy calculation to see how the 
dimensional regularization is implemented. 
The self-energy of Eq. (1.20), now expressed as an integral in d dimensions, is 


d'p p%(p—q)? + p?(p—q)* + 8% (m* — p- (p—q)) 


me =d 2 2e 
=tio I ny [p? — m? + iell(p — q) — m? + ie] 


’ 


(1.22) 


where we retain the same notation I1°4(q) as for d = 4 and we have already com- 
puted the trace. Upon introducing the Feynman parameterization, Dirac relations, 
and integral identities of App. C-5, we can perform the integration over momentum 
to obtain 


2 1 
aß _ 70,8 _ 2 ap, £0 P(e) a) x(1 — x) 
i aC ileal ie oe, Gm" ff ae Pee (1.23) 


We next expand T°% (q) in powers of e and then pass to the limit € —> 0 of physical 
spacetime. In doing so, we use the familiar 


€ Ina‘ 


a =e = e ~] +elnat+-:-, (1.24) 


and take note of the combination 


M So eee 2OG: (1.25) 
(470 )—€ € 


—! makes it nec- 


where y = 0.57221... is the Euler constant. The presence of € 
essary to expand all the other €-dependent factors in Eq. (1.23) and to take care in 
collecting quantities to a given order of €. To order e, the vacuum polarization in 


Feynman gauge is then found to be 
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271 
M(q) = £ [etn —y 


1272 
1 2_g?x(1— 
-6 Í dxx0-om(” = 2) +00] 
0 H 
1 Sa =a 2 2 
a |otin@x)-y+3-mm— p+. lq’ > m), 
_ £ € Mr (1.26) 
127x? | 1 m q 2 2 
-= +ln(4r)- y -n — +z to m > lgl). 
€ u 5m 


The above expression is an example of the general property in dimensional reg- 
ularization that divergences from loop integrals take the form of poles in e. These 
poles are absorbed by judiciously choosing the renormalization constants. Renor- 
malization constants can also have finite parts whose specification depends on 
the particular renormalization scheme employed. One generally adopts a scheme 
which is tailored to facilitate comparison of theory with some set of physical ampli- 
tudes. In the minimal subtraction (MS) renormalization, the Z; subtract off only the 
€-poles, and thus have the very simple form, 


(oe) 


Zp al = 5 w sian (1.27) 
e” 
n=1 


Because the [70 — 1} have no finite parts, they are sensitive only to the ultravio- 
let behavior of the loop integrals, and the c; „n are independent of mass. The simple 
appearance of the MS scheme is somewhat deceptive since further (finite) renor- 
malizations are required if the mass and coupling parameters of the theory are to be 
asociated with physical masses and couplings. A related renormalization scheme 
is the modified minimal subtraction (MS) in which renormalization constants are 
chosen to subtract off not only the €-poles but also the omnipresent term 1n (47 )— y 
of Eq. (1.25). Minimal subtraction schemes are typically used in QCD where, due 
to the confinement phenomenon (cf. Sect. II-2), there is no renormalization scale 
that could naturally be associated with the mass of a freely propagating quark. 
Yet another approach is the on-shell (o-s) renormalization, where the renormalized 
mass and coupling parameters of the theory are arranged to coincide with their 
physical counterparts. 


On-shell renormalization of the electric charge 


The renormalization scale for electric charge is set by experimental determina- 
tions typically involving solid-state devices like Josephson junctions. These refer 
to probes of the electromagnetic vertex —eI,,(p2, pı) of Fig. H-2(b) with on-shell 
electrons (ps = p? = mê) and with q? = (pı — p2) ~ 0. The value of the 
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electromagnetic fine-structure constant œ = e? /47 obtained under such conditions 
is given in rationalized units by 


a! = 137.035999074(44). (1.28) 


To interpret this in the context of the theoretical analysis performed thus far, 
recall from Eq. (1.18) how the photon propagator is modified by radiative 
corrections, 

i e? 
q? 1+ N@) 


We display only the g,,, piece since, in view of current conservation, only it can 


i 
ie? Digg = -7 — e Suv > ie’ Di, Piu (1.29) 


contribute to the full amplitude upon coupling the propagator to electromagnetic 
vertices. The above suggests that we associate the physical, renormalized charge e 


with the bare charge parameter eg by 
2 
2 ep 


~ 147100) 


In this on-shell renormalization prescription, the g,,, part of the photon propagator 


~ eS[1 — TI (0)]. (1.30) 


D, (q) is seen to assume its unrenormalized form in the physical limit q? > 0. 
The appellation ‘on-shell’ means that the physical kinematic point q? = 0 is 
used to implement the renormalization condition, and, by absorbing the singular 
vacuum polarization in the electric charge, one ensures that the photon has zero 
mass. Likewise, in the on-shell renormalization approach fermion propagators have 
poles at their physical masses. 

Next, we show how to infer the form of the renormalization constant 2n in the 
on-shell scheme. There is a relation, called the Ward identity, that implies Z; = Z2 
as a consequence of the gauge symmetry of the theory. From Eq. (1.14), this gives 


e = Zo ep. (1.31) 


Use of the relation e? = Z°e? then specifies the on-shell renormalization con- 
stant to be 
z e fi m? 
ZO) =1- E + In(4r) — y —In (= z) 4 oo|: (1.32) 
í 127? | € u? 


One can similarly absorb the €-pole in either the MS or MS schemes by adopting 


2 


(MS) __ e 
Z; =1- wae + OCC, 
2 


~ 127? 


7 Ms) = 


(G — y + ln(4r) ) + O(e’). (1.33) 
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Fig. II-3 Virtual pair production in the vicinity of a charge. 


Eqs. (1.32), (1.33) display how the various renormalization constants differ by 
finite amounts. The €-poles in the fermion self-energy and the fermion-photon ver- 
tex can be dealt with in the same manner and we find, e.g., in MS renormalization 
(cf. Prob. II-3 and Sect. V—1), 


2 
MS) _ >(MS) _ e l 4 
Zi = z = 1- r tO, (1.34) 
3e? 1 
sm = Ien? m = + O(e). (1.35) 


Electric charge as a running coupling constant 


The concept of electric charge as a ‘running’ coupling constant is motivated by the 
following consideration. In the perturbative Feynman expansion for a given theory, 
the hope is that corrections to the lowest-order amplitudes will be small. However, 
potentially large corrections of the form In q?/q can arise if the theory is renor- 
malized at scale gj but then applied at a very different scale q?. It is convenient 
to deal with this problem by absorbing such logarithms into scale-dependent or 
‘running’ renormalized coupling constants and masses. 

To see why scale-dependent charge is not an unreasonable concept, consider the 
vacuum polarization process of Fig. I—3, which depicts virtual production of a 
fermion of charge Q;e together with its antiparticle near a charge source. Due to 
the source, each such vacuum fluctuation is polarized, and thus the source becomes 
screened. All charged fermion species contribute to the screening, and the larger 
the mass of the virtual pair, the closer they lie to the source. The effect is somewhat 
akin to concentric onion skins, with each virtual pair forming a layer, resulting in 
an effectively scale-dependent source charge. 

Let us seek a method for specifying a running fine structure constant w(q) for 
nonzero momentum transfers, with w(O) to be identified with the «œ of Eq. (1.28). 
The interpretation of e2/(1 + Re TI(q)) as a running charge is appealing since it 
would maintain the simple —i/gq? structure of the lowest-order photon exchange 
amplitude. The fact that TI (q) is divergent (see Eq. (1.26)) can be circumvented by 
subtracting off its value at q? = 0 to define a finite quantity TI (q) = I (q) — TI (0) 
and defining 
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2 


2 e 2 = 
= — ~ 1 — Re III ; 1.36 
e (q) 1+ Re Tq) 2 | e IT(q)] (1.36) 


so that a(q) = e7(q)/4:. It is not difficult to deduce the behavior of TI (q) from 
the integral representation of Eq. (1.26), and we find 


5 2 

— a [a-m + ing?) +--+ P> md, 

UQ = z K m (1.37) 
2+ m? > q’). 
5m? 


Observe that the arbitrary energy scale jz is absent from TI (q), as would be expected 
since I1(q) is a physically measurable quantity. 

The above formulae correspond to the loop correction of one fermion of mass 
m. Generally, loops from all available fermions must be included, although contri- 
butions of heavy (m? >> q?) fermions are seen to be suppressed. Important modern 
applications of the Standard Model engender phenomena at scales provided by the 
gauge-boson masses My, Mz. To obtain an estimate for (M2), we can apply 
Eq. (1.37) to find 


2 
a—'(M5) = œ~! -45a (mite 5) 4] (1.38) 


If a sum over quark-loops (each being accompanied by the color factor Ne = 3) 
and lepton-loops is performed, then the mass values in Tables I-2, I-3 yield the 
approximate determination œ™! (M3) ~ 130. The main uncertainty in this approach 
arises from quarks. It is possible to perform a more accurate evaluation of a(M3) 
(cf. Sect. XVI-6) which avoids this difficulty. 

Let us return to the question of how to define a momentum-dependent coupling. 
To emphasize the fact that a ‘running fine-structure constant’ is after all a matter 
of definition, let us consider a somewhat different derivation (and definition) of 
a(q7). One is able to renormalize the electric charge in a mass-independent scheme 
[We 73] by calculating renormalization constants with m = 0. If we return to the 
vacuum polarization diagram, but with m = 0, we find 


eo NTI 5 
Ma’ = 55 (4) E +In(4x)—y +3 + oo| 
e0 


= 1272 


2 
E pied =i (=) $ oo| . (1.39) 
€ 3 u? 
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In order to apply the renormalization program, we must specify the value of the 


coupling at some renormalization point, which we choose to be q? = —/1%, 
identifying 
2 2 2 
27:2 eo 2 eg (1 HR 
e = xel- — ln +- J]. 40 
erma ale et) oa 
ZER 


However, if we had chosen a different renormalization point TR we would have 
obtained a different value, 


e u2 
e (uk) =e UR) + — In a (1.41) 
R 


1272 


The functional dependence of the charge on the renormalization scale is embodied 
in the so-called beta function of electrodynamics [GeL 54], 
3 


de e` 
= O(e?). 1.42 
dun 2m? + Ole") (1.42) 


It can be shown [Po 74] that the leading and next-to-leading terms in a perturbative 
expansion of Bggp are independent of both renormalization and gauge choices. 
The quantity e?(j7,) defined by integrating the beta function, 


Boep(é) = UR 


de _ dur 
Boev(e) Hr 
is not exactly the same quantity as the running coupling constant defined in 


Eq. (1.36), differing by a (small) finite renormalization. For example, the electron 
contribution to the running coupling in the range m? < u% < M3 is 


: (1.43) 


2 
—a!(uR)| eZ (1.44) 
PR Ry 3 m2’ i 


which contains the dominant logarithmic dependence, but differs from Eq. (1.38) 
by a small additive term. However, complete calculations of all corrections to phys- 
ical observables using the two schemes will yield the same answer. Since the run- 
ning coupling constant is but a bookkeeping device, one’s choice is a matter of 
taste or of convenience. Regardless of the specific definition employed for a(q7), 
we see that as the energy scale is increased (or as distance is decreased), the run- 
ning electric charge grows. This is anticipated from the screening of a test charge 
due to vacuum polarization (recall our explanation of Fig. II-3). As the momentum 
transfer of a photon probe is increased, the screening is penetrated and the effective 
charge increases. 


aT! (uh) 


2 
HR=me 


4 Note that the renormalization point upg and the scale factor jz in dimensional regularization need not be 
identical. They are sometimes confused in the literature, and hence we use a different notation for the two 
quantities. 
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The use of a mass-independent scheme is convenient for identifying the high- 
energy scaling behavior of gauge theories. One useful feature is in the calcula- 
tion of the one-loop beta function. Dimensional analysis requires that the one-loop 
charge renormalization be of the form, 


2\€ 1 
2 = 20 f — god (4) (= + finite terms) + oc , (1.45) 
q? 


where g is the ‘charge’ associated with the gauge theory being considered. Choos- 
ing the renormalization point as q? = —u4, and forming the beta function as in 
Eq. (1.42), we see that 6 = bg?. This allows the beta function to be simply identi- 
fied with the coefficient of e7! to this order. 
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Chromodynamics, the nonabelian gauge description of the strong interactions, 
contains quarks and gluons instead of electrons and photons as its basic degrees 
of freedom [FrG 72, Co 11]. A hallmark of Quantum Chromodynamics (QCD) 
is asymptotic freedom [GrW 73a,b, Po 73], which reveals that only in the short- 
distance limit can perturbative methods be legitimately employed. The necessity to 
employ approaches alternative to perturbation theory for long-distance processes 
motivates much of the analysis in this book. 


SU(3) gauge symmetry 


Chromodynamics is the SU (3) nonabelian gauge theory of color charge. The 
fermions which carry color charge are the quarks, each with field we, where 
a = u,d,s,... is the flavor label and j = 1, 2,3 is the color index. The gauge 
bosons, which also carry color, are the gluons, each with field A“, a=1,..., 8.5 


Classical chromodynamics is defined by the lagrangian 
1 auv a —(a) . a a 
Leor = =7 F Fa + D Vy CDi — MOS YR, (2.1) 


where the repeated color indices are summed over. The gauge field strength 
tensor is 


Fi = hA =A = et ALA, (2.2a) 


5 In this section, it will be particularly important to explicitly display color indices. We shall reserve indices 
which begin the alphabet for gluon color indices (e.g., a,b,c = 1,..., 8), use mid-alphabetic letters for 
quark color indices (e.g., j, k,l = 1, 2, 3), and employ greek symbols for flavor indices. 
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where g3 is the SU (3) gauge coupling parameter, and the quark covariant derivative 
is 


ha 
D y = (1, + iaga) Y. (2.2b) 


The lagrangian of Eq. (2.1) is invariant under local SU (3) transformations of the 
color degree of freedom, under which the quark and gluon fields transform as given 
earlier in Eqs. (1-5.1 1), (1-5.17). Equations of motion for the quark and gluon fields 
are 


(ip — m® )y@ =0, 


ä —a) Xa a 
DY Fi, = 3) V FAV: (2.3) 


In its quantum version, the g3 — O limit of Leolor describes an exceedingly sim- 
ple world. There exist only free massless spin one gluons and massive spin one- 
half quarks. However, the full theory is quite formidable. In particular, accelerator 
experiments reveal a particle spectrum which bears no resemblance to that of the 
noninteracting theory. 

The group SU (3) has an infinite number of irreducible representations R. The 
first several are R = 1, 3, 3*, 6, 6* 8, 10, 10°, ..., where we label an irreducible 
representation in terms of its dimensionality. Quarks, antiquarks, and gluons are 
assigned to the representations 3, 3*, 8 respectively. We denote the group genera- 
tors for representation R by {F,(R)} (a = 1, ..., 8). The quantities 2/2 are group 
generators for the d = 3 fundamental representation, i.e., F(3) = 4/2. They have 
the matrix representation 


010 0 1 00 0 
à=ļ100 M4 = 0 0 47=10 0 =i 
000 i 60 0 i 0 
0 -i 0 Oo 4 wy 9 0 
w=li 0 0 4s=10 0 0 Ag=] 0 A 0 
—— 
0 0 0 i 0 0 00 5 
1 0 0 000 
433=[0 -1 0 As = (0 0 1]. (2.4) 
0 0 0 01i ð 


As generators, they obey the commutation relations 


[Aa, Ap] = 2i fabchc (a,b,c=1,...,8) (2.5a) 
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Table H-1. Nonvanishing f, d coefficients. 


abc Jate abc dabe abc dabe 
123 1 118 1/73 355 1/2 
147 1/2 146 1/2 366 —1/2 
156 —1/2 157 1/2 377 =1/2 
246 1/2 228 1/73 448 =1/2/3 
257 1/2 247 —1/2 558 —1/2/3 
345 1/2 256 1/2 668 —1/2/3 
367 1/2 338 1/3 778 =17/3 
458 J3/2 344 1/2 888 -1/¥3 
678 /3/2 


where the f-coefficients are totally antisymmetric structure constants of SU (3). 
There exist corresponding anticommutation relations 


4 
{Aq, Av} = 3 bab I+ 2dabche (a,b,c =1,...,8) (2.5b) 


with d-coefficients which are totally symmetric. Values for fabe and dabe are given 
in Table H-1. 
Useful trace relations obeyed by the {À} are 


Tr àa = 0 (a =1,...,8) (2.6) 
from Eq. (2.4) and 
Tr AgApy = 2ôab (a,b=1,...,8) (2.7) 


from Eq. (2.5). The statement of completeness takes the form, 
2 ma 
AFAk = ~ 3919 + 25418 jx (i, J k,l = 1,2, 3); (2.8) 


where a = 1,...,8 is summed over. Useful labels for the irreducible represen- 
tations of SU (3) are provided by the Casimir invariants. For any representation 
R, the quadratic Casimir invariant C2(R) is defined by squaring and summing the 
group generators {F,(R)}, 
8 

C(R)I = X F(R). (2.9) 

a=1 

There is also a third-order Casimir invariant, 

8 

CR = È daveFa(RYF5(R)F(R). (2.10) 


a,b,c=1 
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The quark and antiquark states form the bases for the smallest nontrivial irre- 
ducible representations of SU(3). It is possible to use products of them, say p 
factors of quarks and q factors of antiquarks, to construct all other irreducible ten- 
sors in SU (3). Each irreducible representation R is then characterized by the pair 
(p,q). For example, we have the correspondences 1 ~ (0,0), 3 ~ (1,0), 3* ~ 
(0,1), 8 ~ (1,1), 10 ~ (3,0), etc. The (p,q) labeling scheme provides useful 
expressions for the dimension of a representation, 


d(p,q) = (p+ 1)(q + 1)(p +q + 2)/2, (2.11) 
and of the two Casimir invariants, 


Co(p, q) = Bp +34 + pP’ + pq + q°)/3, 
C3(p, gq) = (p —4)Ē2p +4 +3)(24 + p +3)/18. (2.12) 


From Eq. (2.12) we find C2(3) = C2(3*) = 4/3 for the quark and antiquark 
representations. Equivalently, upon setting j = k and summing in Eq. (2.8) we 
obtain 

ijj 


16 
Xe: 4 = z ou = 4C3(3)6;1. (2.13) 


Generators for the d = 8 regular (or adjoint) representation are determined from 
the structure constants themselves, 


(F° (8))be = —i fave (a,b,c =1,...,8). (2.14) 


It follows directly from Eq. (2.14) and from using Eq. (2.12) to compute C2(8) = 3 
that 


Jaca foca = C2(8) Sap = 3 Sap. (2.15) 


This result, in turn, enables us to determine 
farcdbde = Tei Ae] = i fabe focaha = i C2(8)ha. (2.16) 
As a final example involving SU (3), we evaluate the quantity 
ADAIAD — ; (ara, AP] — pat nope + APAPA + nena? ) 
= 4C)(3)M! + ifarcld’, A°] = 4 (cx — 500) a (2.17) 


Shortly, we shall see how such combinations of color factors arise in various 
radiative corrections. 
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Including only gauge-invariant and renormalizable terms, we can write the most 
general form for a chromodynamic lagrangian as 


1 —a ap. =e 08 za a, 
Leen = gern Fe T YZ iD Wh T VpZq iD Wr —¥ Mm Pwr 


83 
aoe Fe, Fi 


where the flavor matrices Zz g are hermitian, color and flavor indices are as before, 


— We Mi wh + (2.18) 


except that for simplicity we suppress quark color notation. The final contribution 
to Eq. (2.18) is called the 6-term. We can reduce Lgen to the form of Leojor by first 
rescaling, 


A3 = ZAS, g= Zg, (2.19) 
and then diagonalizing and rescaling with respect to quark flavors, 


Wi r = UL, RYL, R, UL RZL RU] p = ALR, LR = = ÄR (2.20) 


where Az g are diagonal. Finally we diagonalize the mass terms 


Ppr M Byth, (2.21) 


—//Q 


L ass = -Vr M"? ve 


where M’ = Ag” "U, M UAR” T by means of yet another set of unitary trans- 


formations on the quark fields. Aside from the 0-term, this results in the canonical 
expression for Leolor of Eq. (2.1). 

We shall demonstrate later in Sect. IX—4 that the above quark mass diagonaliza- 
tion procedure induces a modification in the @-parameter, 


0 > 6 = 9 + arg det M’. (2.22) 


This does not imply 6 = 0 because both 0 and the original quark mass matrices are 
arbitrary from the viewpoint of renormalizability and SU (3) gauge invariance. In 
fact, the 9-term cannot be ruled out by any of the tenets which underlie the Standard 
Model. Moreover, although the 0-term can be expressed as a four-divergence 


g 


i 
KH = ete ae (FE, + S fare ARAS) , (2.24) 


Lis 6 a,K", (2.23) 


analysis demonstrates that K” is a singular operator and that its divergence cannot 
be summarily discarded as was done in electrodynamics. This is a curious situation 
because the 0-term is CP-violating. Thus, one is faced with the specter of large 
CP-violating signals in the strong interactions. Yet such effects are not observed. 
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Indeed, it has been estimated that the 6-term generates a nonzero value for the 
neutron electric dipole moment de(n) ~ 5 x 107!° 6 e-cm, but to date no signal 
has been observed experimentally, d.(n) < 2.9 x 107%% e-cm at C.L. 90% [RPP 
12]. This provides the upper bound 9 < 5.8 x 10~!!. Perhaps Nature has dictated 
6 = 0, albeit for reasons not yet understood. 


QCD to one loop 


To develop Feynman rules for QCD, we must first obtain an effective lagrangian 
which properly addresses the issue of SU(3) gauge freedom. For the U(1) gauge 
invariance of QED, this was accomplished by adding a gauge-fixing term to the 
classical lagrangian. The situation for SU(3) is analogous, but somewhat more 
complicated due to its nonabelian structure. If we continue to use a Lorentz- 
invariant gauge-fixing procedure, the effective QCD lagrangian (for simplicity, 
consider just one quark flavor) can be expressed as 


1 E 1 
Locp = —7 Fu F" +WD — moD jade — TALA 
0 


dy ego" Ca + 83,0 fabe AF Wulo)Ce: (2.25) 


Bare quantities carry the subscript ‘0° and the field strengths and covariant deriva- 
tive are defined as in Eqs. (2.2a), (2.2b). The quantities {ca(x)} (a = 1,..., 8) are 
called ghost fields. As explained in App. A-5, they are anticommuting c-number 
quantities (i.e., Grassmann variables) which couple only to gluons. Ghosts occur 
only within loops, and never appear as asymptotic states. Each ghost-field loop 
contribution must be accompanied by an extra minus sign, analogous to that of a 
fermion—antifermion loop. Their presence is a consequence of the Lorentz-invariant 
gauge-fixing procedure. In alternative schemes such as axial or temporal gauge, 
ghost fields do not appear, but compensating unphysical singularities occur in 
Feynman integrals instead. 
The Feynman rules for QCD are 


three-gluon vertex: 
u,a v,b 

— 83,0 fabcl8uv (P = Qa + Burd = an p q 
F8au r — P)v] 


(2.26) 
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quark—gluon vertex: 


f às 
—123,0(Ywap z 
jk 


four-gluon vertex: 


—i g3 o [Cave fede(8ua8vo _ Spo v1) 
+ face JS nde Suv Bie = Euogva) 
+ fade IS che Sui Bue = Sur8rc)| 


ghost—gluon vertex: 


— 83,0 fabclu 


quark propagator i 3 (p): 


iô jx (P + Mo)ap 
p — m +ie 


gluon propagator i D® (q): 


Hv 
idab q”q” 
o” 4 (1— 
1 g ( 50) 34 ie 
ghost propagator: 
idab 
p +ie 


(2.27) 
u,a À,C 
o,d 
(2.28) 
u,a 
D= gl 3 ee = C 
r 
(2.29) 
p 
B,k ——— oj 
(2.30) 
q 
vb ANNANN ua 
(2.31) 
p 
b— — —» — -a 


(2.32) 
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The above rules involve a total of four distinct interaction vertices. Of these, 
the three-gluon and four-gluon self-vertices, and the ghost-gluon coupling have no 
counterpart in QED. That all four vertices are scaled by a single coupling strength 
23 is a consequence of gauge invariance. Also, chromodynamics exhibits a certain 
coupling-constant universality, called flavor independence, in the quark—gluon sec- 
tor. All fields which transform according to a given representation of the SU (3) of 
color have the same interaction structure, e.g., all triplets couple alike, all octets 
couple alike but differently from triplets, etc. Quarks are assigned solely to the 
color triplet representation. Thus, the quark—gluon interaction is independent of 
flavor. 

The renormalization constants of QCD are 


AS = ZAR, 830 = ZZ; gs, 


iir 1/2 — 
y= Za y”, = Zi Z3 "gs, 
ct = Zy (cy, sfpl 2 "n (2.33) 
= PEDER 
Eo = 236, = ZZ, Z, 1 93, 


mo =m— êm, 


where the quantities Z1, Zı, Zir, and Z4 are defined by the above coupling, con- 
stant relations and can be determined from Z2, Z3, and Z3. In the following, work- 
ing in & = 1 gauge we shall compute the one-loop contributions to the gluon 
self-energy and to the quark—gluon vertex, and, by absorbing the €-poles, thereby 
obtain expressions for Z3 and Zıp to leading order. Determination of the remain- 
ing renormalization constants, which can be computed from loop corrections to 
the quark and ghost propagators and the three-gluon, four—gluon, and ghost-gluon 
vertices will be left as exercises. However, it is clear from the definition of g3.9 in 
Eq. (2.33) that the relations 


= (2.34) 


must hold in any consistent renormalization scheme. These are the analogs of the 
Ward identities in QED. Physically, they ensure that the coupling-constant relations 
which appear in the QCD lagrangian (as a consequence of gauge invariance) are 
maintained in the full theory. 

The QCD one-loop contribution to the quark—antiquark vacuum polarization 
amplitude of Fig. I-4(a),° 


6 To avoid notational clutter, we shall not put subscripts on the bare coupling for the remainder of this 
subsection. 
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(a) (b) (c) 


Fig. II—4 One-loop corrections to the gluon propagator: (a) quark—antiquark pair, 
(b) gluon pair, and (c) ghosts. 


. 2 4 
: råb = —183 d Pp 
illo 4] quark a ( 2 ) (27)* 


x Tr 0% | (2.35) 


i 
Wya — ~ 
je 1? ae ee 


i 
p—m+ 
differs from the QED self-energy only by the group factor (°); (AP) = 
Tr (A70") = 28% (cf. Eq. (2.7)). Comparing with Eq. (1.39), we obtain 


2\€ 2 
“Ta . sa H g3 1 
iTe (a) | vare = 18” (qag — Bapg) (4) | Ba ahi | ; (2.36) 


This must be multiplied by the number of quark flavors n ¢ which contribute in the 
vacuum polarization loops. 

The contribution from the gluon—gluon intermediate state of Fig. I—4(b) can be 
written 


1 d*k Ne 
1% = ~(-i a) an 2.37 
AD g = 9) (20)* [k2 + iell(q — k)? + ie] a 
with 
NES = 93h L—gpu(g + Kv + 8u (2k — Gp + 8p(2g — Ku 
x 83 f Tet (q tk” +g” lq —2k)a + g”, (k — 2q)"1. (2.38) 


The prefactor 1/2 in Eq. (2.37) is a Feynman symmetry factor associated with the 
identical intermediate-state gluons. To arrive at this expression, special care must 
be exercised with momentum flow in the three-gluon vertices. Upon extending the 
integration to d dimensions and using Eq. (2.15) to evaluate the color factor, we 
obtain 


dk Nag 
(27)! [k? + ie][(q — k)? + ie] 


1 
i123 () | gruon = —ZC2(B)8 Bs u (2.39) 


gluon 
with 


Nop = (—5q? + 2q - k — 2k*) gap + (6 — d)quag 
+ Od = 3) Guks Hapka) + 6 4D ke. (2.40) 
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Integration of Eq. (2.39) a 


11°54) loan = =i pN TH 9 lta 
L aß q) [gluon = a 2g? 3 z 14g 6 Eap4 Ie 
(2.41) 


The final contribution to the gluon propagator is the ghost-loop amplitude of 
Fig. U-4(c), 


Teo =- d'k i 
: q ghost (27)t (k — q)? i ie 
x [g3 f°" (k — Delo =e [S ka ]. (2.42) 


The bracketed quantities arise from the P vertices, and the minus 
prefactor must accompany any ghost loop. Following the standard steps to a 
d-dimensional form, we arrive at 


d¢k ka(k — q)p 


2 any 
—g3; C2(8)d (27)? [(k — q)? +ie][k? +ie]’ 


: qab 
i Tiag (A)| onos = (2.43) 


which becomes to leading order in €, 


iTe (q) 


2 € 
E k l l 2| 1 
‘oe ia -8u —. (244 
ghost 167 702 (8) (5 —q? :) È qg ai 6° BY | Je ( ) 


The sum of gluon and ghost contributions takes the gauge-invariant form 


2 € 
a ab 83 S/n 23 1 
ill “Daten = —iô Br z C2(8) (4 =) [daag — 8up4 ] 7 +... 


(2.45) 
Finally, adding the quark contribution for n ș flavors gives the total result 
2 2 \*72n 3 
11° (q) = i8 (da — ggg E 4 fee | 
ap (4) = 18 (dadp — apq en q 3 3 2(8)| t 
(2.46) 
Renormalizing at q? = — u, we find’ 
efun [ony 
Z= 1- 2 C28) = O(g3). (2.47) 
87 HR 3 
Proceeding next to the quark-gluon vertex, written through first order as 
. 8&3 a . §3 a 4 a 
= awi (p2, Pi) = = a )ji = ig3(A}) ji(p2; pP) +, (2.48) 


7 For notational simplicity, we discontinue displaying the superscript (MS) on renormalization constants. 
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Fig. II-5 One-loop corrections to the quark—gluon vertex. 


we see from Fig. II—5 that there are radiative corrections from both quark and gluon 
intermediate states. The quark contribution is 


‘ 3 4 . aß 
j a —183 dtk —ig 
LA a ( 2 ) Ory Bie Ore 
i i 
x Any — ive. (2.49 
ray eee )ni Y leken" Jiyg- (2.49) 


Aside from the replacement e —> g3 and a color factor A?A1A? /8, which is eval- 
uated in Eq. (2.17), the remaining expression is the QED vertex, which will be 
analyzed in detail in Sect. V-1. Thus we anticipate from Eq. (V—1.19) that at 
Pi = P = p and |p? > m’, 


1 2 j 2\¢ 
[AS(p, p)ljil = (C28) — 5028) S (+ ) (A7/2)jivy +. 


quark 8m2 Je —p? 
(2.50) 
The two-gluon intermediate state, which has no counterpart in QED, has the form 
3 4 
: a a c3 b §3 d'k B a 
—igs(A, (P2, P1)) il gtuon = i fabe AA?) ji af Om” (k +m)y 


E 8vp(2p2 — k — P1)a + 8pa (2k — pı — P2)v + 8av(2 P1 — k — pr) 

[Ae — m? + tel — K? de] po = k) + ie] l 

By a now-standard set of steps, it is not difficult to extract the €-pole from the 
extension of the above to d dimensions, and we find 


(2.51) 


A‘ = 04/2) an 2Cx8) 2 (HY 3 2.52 
( ICP, P))ji lefts = ( / )ji Yv 2 2( ) 8m2 —p? Ve + Ae ( . ) 
implying a total vertex correction of the form, 
AP, Diha = 0D (C208) + C28) (AL) J 
v\P> P)) Jil tor = jiYv L-2 827 \ — p? 2€ 


(2.53) 


We thus determine the renormalization constant for the quark-gluon vertex at p? = 
— u4 to be 
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7 2€ 
DE 83 H 1 
Zip = 1 — [C2(3) + C2(8)] z apersi (2.54) 
877 \ UR 2€ 


There remains the task of determining Z2. We shall leave this for an exercise 
(cf. Prob. II-3) and simply quote the result 


2 2€ 
1 
Z=1- 08L |” EN (2.55) 
87? \ UR 2€ 


Asymptotic freedom and renormalization group 


A striking property of QCD is asymptotic freedom [GrW 73a,b, Po 73]. This is the 
statement that, unlike the electric charge, the coupling constant g3(upr) of color 
decreases as the scale of renormalization up is increased. To demonstrate this, we 
first combine our results for Z;, Z2 and Z3 to obtain the coupling renormalization 
constant Z,, 


_15-1/2 
83,0 = ZIF Z3 "Ze l g3 = Zg83, 


a 2n p ee 
Z, 21 =) 11 T 2.56 
s a ( 3 (£) T vaa 


where œs = 83 / (47). From the e~! coefficient of Z g» we learn that 


g3 
167? 


11 
ur =- Rew = Hog] + O(83), (2.57a) 
R 


or equivalently, 


ocn = — (u = zr) 83 + O(g3) = —Bo 83 + O(g3). (2.57b) 

167? 3 167? i 
The sign of the leading term in Bgcp is negative for the six-flavor world np = 6, 
becoming positive only if the number of quark flavors exceeds 16. As we have 
already seen, the QED vacuum acts as a dielectric medium with dielectric con- 
stant €gzp > 1 because spontaneous creation of charged fermion—antifermion 
pairs results in screening (i.e., vacuum polarization) of electric charge. The dielec- 
tric property €gzp > 1 means that the QED vacuum has magnetic susceptibility 
Lorp <1, and thus is a diamagnetic medium. The QCD vacuum is the 
recipient of similar effects from virtual quark—antiquark pairs, but these are over- 
whelmed by contributions from virtual gluons. As a result, the QCD vacuum is 
a paramagnetic medium (f4gcp > 1) and antiscreens (€gcp < 1) color charge 
[Hu 81]. 
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The effect of asymptotic freedom can be displayed most clearly by performing 
a renormalization group (RG) analysis on the 1 PZ amplitudes of the theory. A 
connected? renormalized Green’s function is defined in coordinate space as 


gerat = (OT (T GA a ea (2.58) 


where the numbers of quark and gluon fields are np, ng, respectively, and for 
convenience we suppress color and Lorentz indices. We employ the same symbol 
G"¥-"8) for the momentum Green’s function 


(270)*8*(pi +++ + paG ({p}) = J [kiy n 
j=l 


(2.59) 


where n = np + ng. The 1PI amplitudes T”F:”2) are obtained by removing the 


external-leg propagators from G9”, 


Gr” =] Pe [sepr d []P@ [se 260 
r T i j 


where unprimed (primed) momenta represent initial (final) states. The relations of 
Eq. (2.33) imply for any renormalization scheme, which we need not specify yet, 
that 


Gorn) — A E 
D = Z;' Do, S = Z3 ' So, (2.61) 
where the zero subscript denotes unrenormalized quantities. From this, we have 
porns) — Yas Aaa ERB (2.62) 
and the combination of terms 


2°"? (ur) Z3” (ur) POP" ({p}, gs(ur), mur), F(R); ur) — (2.63) 


is therefore independent of the renormalization scale upg. 

Let us now ascertain the behavior of “*:"*) in the deep Euclidean kinematic 
limit where all momenta {p} are both spacelike (in order to avoid singularities) and 
very large compared to any other mass scale in the theory. To keep the situation 
as simple as possible, we omit the dependence of T ”F:”2) on both the quark-mass 
m(ur) and gauge £(upr) parameters.” Then from Eq. (2.59) we find in response to 
a scale transformation p — ip that 

8 All the fields participating in a connected Green’s function are affected by interactions; in a disconnected 


Green’s function, one or more of the field quanta propagate freely. 
9 We shall define a ‘running mass parameter’ later, in Chap. XIV. 
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GO") Ap}, galar); pr) = AOE? GOED) Cy}, galur); R/d)- (2.64) 


This behavior is almost that of a homogeneous function occurring in a scale- 
invariant theory. Canonical dimensions of the fields appear in the exponent of the 
scaling factor along with an additive factor of four arising from the four-momentum 
delta function in Eq. (2.59). However, in G“""-"®), there is also an implicit depen- 
dence on A due to the presence of the renormalization scale upg. The corresponding 
scaling property of the 1 PZ amplitude is found from Eqs. (2.63), (2.64) to be 


TOP") Ap}, g3(uRr); pp) = AB r2 perne) p}, g3(uRr); UR/A) (2.65) 


or 


Zur) \ "8? 
ror") ((Ap}, ga(ue)s He) = Amen? ( a a) 
Z3(uR) 
F (owe 
Za(uR) 
This functional relationship can be converted to a differential RG equation by tak- 
ing the A-derivative of both sides and then setting à = 1, 


—nF/2 
) Torne) p}, g3(À uR); UR). (2.66) 


~ ð 3 ð 
X pi +ng(l + ye) +nr =+ yr ) — 4- boco— Jr" =0, 
F api 2 0g3 
(2.67) 


where 


=e Ne, p= eZ)? (2.68) 
OLR OUR 
are called the anomalous dimensions of the respective fields and Bgcp is as in 
Eq. (2.57). 
Let us now see how to obtain leading-order estimates for the above anomalous 
dimensions. To this order, the result for Bgcp is both gauge and renormalization 
scheme-independent. To start, we can use the result of Eq. (2.55) to determine yF, 


1 ə ln Z2 g3 4 
= 3 C3 O(g3), 2.69 
aR i602 2(3) + O(g3) (2.69) 


and analogously for yg. To solve the RG equation, we employ the variable t = In À, 
where A is the scaling parameter appearing in Eqs. (2.64)—(2.66), and introduce the 
running coupling constant g3(t), 


- 
a = BR), B0) = g. (2.70) 
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Then it is straightforward to verify that the solution to Eq. (2.67) is 


(fe p}, g3(4r)s Mr) = OFFI DAT Ap}, B30); ur), (2.71) 


where 
D(t) = exp (- / dt’ [nsys (83(t’)) + neve (83(t’)) | ) (2.72) 


is the anomalous dimension factor. The scaling behavior of the 1PZ amplitude 
is seen to have field dimensions with anomalous contributions in addition to the 
canonical values. 

Despite naive expectations, the interaction strength at the scaled momentum is 
not the constant g3, but rather the running coupling constant g, whose magnitude 
decreases as the momentum is increased. Employing the lowest-order contribution 
for Bgcp in Eq. (2.57b), we can integrate Eq. (2.70) over the interval 4 < t < t to 
obtain 


(83(t2))-* — E361)? = 2 (11 — 2n¢/3) (h — t1)/167°, (2.73) 


where n is the number of quark flavors having mass less than ,/h. It is conven- 
tional to express this relation in a somewhat different form. Defining a scale A at 
which g, diverges and letting a,(q7) = z (q°)/47, we have to lowest order, 


4r 1 


127/3) iIn@?/A) (2.74) 


as q’) = 
where n p is the number of quark flavors with mass less than Vg. Higher order 
contributions are discussed at the end of this section. 

If œ, (q°) continues to grow as q° is lowered, any perturbative representation of 
Bocp ultimately becomes a poor approximation, and we can no longer integrate 
Eq. (2.70) with confidence. Although unproven, a popular working hypothesis 
is that the QCD coupling indeed continues to grow as the energy is lowered, 
leading to the phenomenon of quark confinement. In QED, the free parameter 
a(q = 0) = 1/137 is quite small and expansions in powers of œ converge rapidly. 
However QCD behaves differently. In particular, it is clear from Eq. (2.74) that as 
is not really a free parameter, but is instead inexorably related to some mass scale, 
e.g., A. This phenomenon, called dimensional transmutation, means that an energy 
such as A can effectively serve to replace the dimensionless quantity a, in the for- 
mulae of QCD. Specifying QCD operationally requires not only a lagrangian but 
also a value for A. For example, QCD perturbation theory is useful only if ‘large’ 
mass scales M (i.e. those with (A/M)? < 1) are probed. Because the complex- 
ity of low-energy QCD has thus far prevented direct analytic solution of the theory, 
there have been substantial efforts to develop alternative approaches. These include 
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Table II-2. Determinations of a;(Mz). 


Experiment q [GeV] ats (q7) as (Mz) 

t decays 1.777 0.330 + 0.014 0.1197 + 0.0016 
DIS [F>] 2—> 15 =-= 0.1142 + 0.0023 
DIS [e+ p — jets] 6 — 100 SS 0.1198 + 0.0032 
QQ states 7.5 0.1923 + 0.0024 0.1183 + 0.0008 
Y decays 9.46 Oe 0.1190+9:006 

ete jets & shapes 14 —> 44 --- 0.1172 + 0.0051 
ere” [ew] 91.17 0.1193 + 0.0028 0.1193 + 0.0028 
ete jets & shapes 91 — 208 === 0.1224 + 0.0039 


attempts to solve QCD numerically (lattice-gauge theory), phenomenological study 
of various theoretical constructs (potential, bag, Skyrme models), exploitation of 
the invariances contained in Locp (notably chiral and flavor symmetries), and con- 
sideration of the infinite color limit Ne — œo as a first approximation to QCD 
(NZ! expansion). The first of these topics is beyond the scope of this book (e.g. see 
[GaL 10, DeD 10]), but the others will form the basis for much of our discussion. 

Attempts to infer w,(q7) from experimental data are typically carried out under 
kinematic conditions for which a perturbative analysis of QCD presumably makes 
sense. Systems commonly used for this purpose include decays of the t lepton, 
deep-inelastic scattering (DIS) structure functions, Y decay, and hadronic event 
shapes and jet production in e*e~ annihilation. Suppose, as is generally the case, 
a given process is computed to some order in QCD perturbation theory and reg- 
ularized in the MS scheme. If such a theoretical expression is then used to fit the 
data with a q-value characteristic of the given process employed, an expression 
such as Eq. (2.74) can be used to determine A and a,(q7) can be evolved to dif- 
ferent g. Since this operation depends on both the regularization procedure and 
the number of quark flavors n+ used in Eq. (2.74), a notation like A 
precise. Unfortunately there is no uniformity in the rate of convergence of QCD 
perturbation theory from process to process. Thus, determinations of œ,(q°) are 
affected by both theoretical and experimental uncertainties, and a scatter of quoted 
values results. Nonetheless, an impressive consistency now exists between determi- 
nations carried out for a variety of conditions. Table II—2 lists values of a,(Mz) as 
inferred from a diverse set of experimental inputs [Be 09], and Figure II-6, which 
has attained the status of a QCD icon, displays the same. The current world average 
at the Z-boson mass scale is [Be et al. 11] 


would be 


as(Mz) = 0.1184 + 0.0007, (2.75) 
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Fig. Il-6 Energy dependence of œs (Q), from [RPP 12] (used with permission). 


which implies the value A = 213 + 8 MeV for the five-flavor sector of QCD. 
Determinations of œ, (q?) have been found to be qualitatively in accord consistent 
with the predicted q? dependence of QCD. Taken over the full range of available 
data, values in the range 0.2 < A(GeV) < 0.4 are not uncommon, e.g., AS = 
339 + 10 MeV and AX’ = 296 + 10 MeV as cited in [Be et al. 11]. 

To conclude this section, we briefly comment on the status of higher-order con- 
tributions to the running of the strong fine structure constant. To date, analytic 
calculations on a, (jz) have been performed up to the four-loop level, 


a 
as = —Boa? — Bia; — Bra} — pba? +, (2.76) 


2 


in which as = &@s/(47) is the expansion parameter and exact expressions for the 
coefficients Bo, 61, 62 and 63 appear in [RiVL 97]. The following useful approxi- 
mations are also provided, 


Bo X 11 — 0.666677 ; 

Bı = 102 — 12.66677 ; 

By = 1428.50 — 279.617 s + 6.01852n°%, 

P3 = 29243.0 — 6946.30n s + 405.08974 + 1.49931n%, (2.77) 


where as usual n ¢ denotes the number of active flavors. The four-loop running of 
a, can then be expressed as [ChKS 98], 
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ee 4r [i 7 Bi Int B?(n*t —Int — 1) + Boho 
Bot Bo t Bat? 
B? (In? t — 2.5 In? t — 2Int + 0.5) + 380818211 t — 0.58 3 
i | 
(2.78) 


where t = In(u% /A?). As an example, let us use this (taking ns = 5) to determine 
&s(u) at three mass scales involving respectively the b quark, the Z boson, and the 
Higgs boson, i.e., up = 4.18 GeV, uz = 91.1876 GeV and wy = 125.5 GeV, 


&æs (mp) = 0.2266, a;(Mz) = 0.1184, as;(My) ~ 0.1129. (2.79) 


These values reflect the behavior expected from asymptotic freedom, as discussed 
earlier.!° They will later be of use in discussing running quark mass (Chap. XIV) 
and Higgs-boson phenomenology (Chap. XV). 


II-3 Electroweak interactions 


The Weinberg—Salam—Glashow model [G1 61, We 67b, Sa 69] is a gauge theory of 
the electroweak interactions whose input fermionic degrees of freedom are mass- 
less spin one-half chiral particles. It has the group structure SU (2); x U(1), where 
the SU(2),, U(1) represent weak isospin and weak hypercharge respectively. The 
subscript ‘L’ on SU (2), indicates that, among fermions, only left-handed states 
transform nontrivially under weak isospin. 


Weak isospin and weak hypercharge assignments 


First, we shall discuss how the fermionic weak isospin (Tw, Tw3) and weak hyper- 
charge (Yw) quantum numbers are assigned. The fermion generations are taken 
to obey a ‘template’ pattern — we assume that each succeeding generation differs 
from the first only in mass. Thus, it will suffice to consider just the lightest fermions 
for the remainder of this section. The first-generation electroweak assignments are 
displayed in Table H-3. 

For weak isospin, experience gained from charged weak current interactions 
such as nuclear beta decay dictates that left-handed fermions belong to weak iso- 
doublets while right-handed fermions be placed in weak isosinglets, as in 


10 Using the exact relations for Bo, ..., ß4 yields the same results to the stated level of accuracy. 
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Table II-3. SU(2); x U(1) fermion assignments 


Particle Ty Ty3 Yw 
Ve,L 1/2 1/2 -1 
er 1/2 -1/2 =i 
Ve, R 0 0 0 
eR (0) (0) —2 
uL 1/2 1/2 1/3 
d, 1/2 =1/2 1/3 
UR 0 0 4/3 
dR 0 0 —2/3 
Ve 
leptons : lL = ( ) Ve,R ER, 
EJ 
u 
quarks : 4L = ( ) ur dpr. (3.1) 
dj; 


In view of nonzero neutrino mass, we include a right-handed neutrino. Each of the 
degrees of freedom displayed above must be assigned a weak hypercharge. There 
are a priori six in all,!! 


Yu) = Yq, Yur) = Yu, Y(dr) = Ya, 
Y(£L) =Y, Y(er)=Y¥., Y(vr) =Y.. (3.2) 


In the Standard Model one identifies the electromagnetic current, following spon- 
taneous symmetry breaking in the electroweak sector, by its coupling to the linear 
combination of neutral gauge bosons having zero mass. The electric charge Q car- 
ried by a particle is thus linearly related to the SU (2); x U(1)y quantum numbers 
T,y3 and Y,,, 


aQ = Tw + bYy, (3.3) 


where a, b are constants. We can use the freedom in assigning the scale of the 
electric charge Q to choose a = 1. At this point, let us not assume any knowl- 
edge of the fermion electric charge values. Ultimately, however, the left-handed 
and right-handed components of the charged chiral fermions must unite to form 
the physical states themselves. Consistency demands that the electric charges of 
the chiral components of each such charged fermion be the same, whatever value 
that charge might have. Using Eq. (3.3), we find 
11 The reason that weak hypercharge engenders so many free parameters in contrast to weak isospin lies in the 
difference between an abelian gauge structure (like weak hypercharge) and one which is nonabelian (like 
weak isospin). Thus all doublets have the same weak isospin properties irrespective of their other properties, 
analogous to flavor independence in QCD. For the abelian group of weak hypercharge, the group structure by 


itself provides no guidelines for assigning the weak hypercharge quantum number. Like the electric charge, it 
is a priori an arbitrary quantity. 
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1 1 1 1 
Vays aye Ba es ae 
q 14+ dp : Tap 2b 


Additional information is contained in axial anomaly cancelation conditions, to be 
discussed in detail in Sect. HII-3 (see especially Eq. (IM-3.60b) and subsequent 
discussion). In particular, the cancelation requirement implies the conditions 


(3.4) 


Tei yo = 0, (3.5a) 
ify, =0, (3.5b) 
Try? =0, (3.5c) 


where ‘Tr’ represents a sum over fermions and in Eq. (3.5a) F3 is the third generator 
of the octet of color charges. These constraints imply 


2Y, — Y, — Ya = 0, (3.6a) 
3Y, + Ye =0, (3.6b) 
2GY,° + Ye) -3(V,7 + Yr) - Y,° — ¥} =0, (3.6c) 


where the factors of ‘3’ are color related and the minus signs arise from chirality 
dependence of the anomalies. Then, insertion of Eq. (3.4) into Eqs. (3.6b), (3.6c) 
yields 


1 3 
(ou +5) -¥3=0. (3.7) 


If neutrinos are Majorana particles (i.e. identical to their antiparticles), then they 
cannot carry electric charge and by Eq. (3.3), one has Y, = 0. If so, Eq. (3.7) 
implies bY; = —1/2, which fixes the remaining Y; via Eq. (3.4). Thus, provided 
neutrinos are Majorana particles, once the weak isospin is chosen as in Eqs. (3.1), 
(3.2) and all possible chiral anomalies are arranged to cancel, one obtains a pre- 
diction for the fermion electric charge. We also learn that any attempt to determine 
weak hypercharge values from the known fermion electric charges is affected by 
an arbitrariness associated with the value of ‘b’. This accounts for the variety of 
conventions seen in the literature. For definiteness, we have taken b = 1/2 
in Eq. (3.3) and thus the relationship among the various quantum numbers in 
Table H-3 is 


Yw = 2(Q — Tw3). (3.8) 


On the other hand, if neutrinos are Dirac particles, it follows from Eq. (3.4) that 
Eq. (3.7) becomes a trivality and we learn nothing of weak hypercharge assign- 
ments from anomaly cancelation arguments. In this instance, one assigns the weak 
hypercharge by inserting the observed fermion electric charges into Eq. (3.8). The 
ability to predict {Q;} values has been lost. 
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SU(2), xU(Dy gauge-invariant lagrangian 


Having assigned quantum numbers, we turn next to the electroweak interactions. 
The Weinberg—Salam lagrangian divides naturally into three additive parts, gauge 
(G), fermion (F), and Higgs (H), 


Lws = Lg + Lr + Ly. (3.9) 


Throughout this section we shall concentrate on establishing the general form of 
the electroweak sector, referring at times to only a few tree-level amplitudes. We 
shall return in Chap. V to the subject of electromagnetic radiative corrections, and 
present the electroweak Feynman rules along with various radiative corrections in 
Chap. XVI. 

The gauge-boson fields, which couple to the weak isospin and weak hypercharge 
are, respectively, W u = (Wi, Wi W3) and B,,. These contribute to the purely 
gauge part of the lagrangian as 


1 1 


Lo= = Fi = gB Bw» (3.10) 
where F a (i = 1,2,3) is the SU (2) field strength, 
Fi, = ðpWi — 3,Wi, — gz WIW, (3.11) 
and B,,, is the U (1) field strength, 
Buy = 3p Bu — dy By. (3.12) 


The fermionic sector of the lagrangian density includes both the left-handed and 
right-handed chiralities. Summing over left-handed weak isodoublets yz and right- 
handed weak isosinglets yr, we have 


Lr=}_ ViP vi + Do VRiD ve. (3.13) 
WL WR 


Since a right-handed chiral fermion does not couple to weak isospin, its covariant 
derivative has the simple form 


Dur = (8, + iF Ya Bur. (3.14) 


This expression serves to define the U (1) coupling g1. Its normalization is dictated 
by our convention for weak hypercharge Y,,. The corresponding covariant deriva- 
tive for the SU (2), doublet wy, is 

T> 


Piy) + isa Wy) Yi, (3.15) 


DW. = (1 (a, n is 


given in terms of the SU (2) gauge coupling constant g> and the 2 x 2 matrices I, T. 
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We shall not display the quark color degree of freedom in this section for reasons 
of notational simplicity. However, it is understood that all situations in which quark 
internal degrees of freedom are summed over, as in Eq. (3.13), must include a color 
sum. Similarly, relations like Eq. (3.14) or Eq. (3.15) hold for each distinct internal 
color state when applied to quark fields. 

The above equations define a mathematically consistent gauge theory of weak 
isospin and weak hypercharge. However, it is not a physically acceptable electro- 
weak theory of Nature because the fermions and gauge bosons are massless. 
A Higgs sector must be added to the above lagrangians to arrive at the full 
Weinberg—Salam model. Thus, we introduce into the theory a complex doublet 


: 
p = a (3.16) 


of spin-zero Higgs fields with electric charge assignments as indicated. The quanta 
of these fields then each carry one unit of weak hypercharge. The Higgs lagrangian 
Ly is the sum of two kinds of terms, Lyg and Lyr, which contain the Higgs— 
gauge and Higgs—fermion couplings respectively. The former is written as 


Lug = (D")*D,,® — V(®), (3.17) 
where 
.§1 _ Tt > 
D,® = (1[3,+788,] +1805 tH) ®, (3.18) 


and V is the Higgs self-interaction, 
V(®) = —p’®'® + A(0")’. (3.19) 


The parameters u? and à are positive but otherwise arbitrary. For simplicity, we 
write the Higgs—fermion interaction in this section for just the first generation of 
fermions. Denoting the left-handed quark and lepton doublets respectively as gz 
and £z, we have 


LHF = — 9G, Pup = gaq, Pdr = golL OveR = gel, Per +h.c., (3.20) 


where the coupling constants g,, &4, 8e and g, are arbitrary and we employ the 
charge conjugate to ®, 


= ind". (3.21) 


In a sense the Higgs potential V and Higgs—fermion coupling Lyp lie outside 
our guiding principle of gauge invariance because neither contains a gauge field. 
However, there is no principle which forbids such contributions, and their presence 
is phenomenologically required. Moreover, note that each is written in SU(2); x 
U (1) invariant form. 
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Spontaneous symmetry breaking 


Mass generation for fermions and gauge bosons proceeds by means of spontaneous 
breaking of the SU(2), x U(1) symmetry. To begin, we obtain the ground-state 
Higgs configuration by minimizing the potential V to give 


(—u? + 2A0'®) = 0. (3.22) 


We interpret this ground-state relation in terms of vacuum expectation values, 
denoted by a zero subscript. Eq. (3.22) has two solutions, the trivial solution 
(®)o =0 and the nontrivial solution, 


v? 


(D D)o = z (3.23) 
with 


2 
v= o (3.24) 


Let us consider the latter alternative. A nontrivial vacuum Higgs configuration, 
which obeys the constraint Eq. (3.23), respects conservation of electric charge, 
and describes the spontaneous symmetry breaking of the original SU(2);, x U(1) 
symmetry is 


0 
(®)y = (, vA: (3.25) 


In one interpretation, it is the order parameter for the Weinberg—Salam model, play- 
ing a role analogous to the magnetization in a ferromagnet. Group theoretically, it 
is seen to transform as a component of a weak isodoublet. The energy scale, v, of 
the effect is not predicted by the model and must be inferred from experiment. 
The fermion and gauge-boson masses are determined by employing Eq. (3.25) 
for the Higgs field everywhere in the lagrangian Cy. We first define charged 
fields Wis 


1 
W,” = [o + iW;). (3.26) 


corresponding to the gauge bosons W~. By substitution, we find for the mass 
contribution to the lagrangian 


”(g,iiu + gadd + 2.2 182)" y+ yy 
T petta + geze) + (=) „u W- 


v? g ~g1g wl 
— (W? B 2 lo ao 3:27 
t3 hare e? gi BY een 


ne = 
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The fermion masses are given by 
wv 
mf = W 
Although the theory can accommodate fermions of any mass, it does not predict 
the mass values. Instead, the measured fermion masses are used to fix the arbitrary 
Higgs-fermion couplings. The charged W-boson masses can be read off directly 
from Eq. (3.27), 


Ef (f = 4, d, e, we) (3.28) 


v 
Mw = 7 82; (3.29) 


but the symmetry breaking induces the neutral gauge bosons to undergo mixing. 
Their mass matrix is not diagonal in the basis of W?, B states. Diagonalization 
occurs in the basis 

Zu = COS Ow wi — sinw By, 

Ay = sin Oy wW + cos Ow By, (3.30) 


where the weak mixing angle (or Weinberg angle) 0w is defined by 


tan 0w = —. (3.31) 
§2 


The neutral gauge-boson masses are found to be 


v 
M,=0, Mz= ya + 33, (3.32) 


and the fields A,, and Z, correspond to the massless photon and massive Z°-boson, 
respectively. Observe that the W*-to-Z° mass ratio is fixed by 


Mw 
— = cos øy. (3.33) 
Mz 

Electroweak currents 


Now that we have determined the mass spectrum of the theory in terms of the input 
parameters, we must next study the various gauge—fermion interactions. The tradi- 
tional description of electromagnetic and low-energy charged weak interactions of 
spin one-half particles is expressed as 


Lin = —eA, JE — ala Ia (3.34) 


where Jim is the electromagnetic current 


em 


2 i 
Jem = —ey"e + suy"u— zdy"d +- , 2:29) 


Ji; is the charged weak current (ignoring quark mixing) 
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JG, = Vey" + ysle+uy"(l+ys)d +--+, (3.36) 


and Gp ~ 1.166 x 10~> GeV” is the Fermi constant (cf. Sect. V—2). 
Alternatively, we can use Eqs. (3.13)—(3.15) to obtain the charged and neutral 
interactions in the SU(2); x U(1)y description, 


£ int = (8B 


where J’, is the third component of the weak isospin current, 


E (WEIK + WIIK) — go Warts — gB h- Jk), 837) 


> — Tt 
Je =} iy sh (3.38) 
WL 


summed over all left-handed fermion weak isodoublets. Substituting for B, and 
Wi in Eq. (3.37) in terms of A, and Z, yields 


C= E- = (WJA +W, I) — 81008 Ow Ap Jim + Lati—wk: (3.39) 


where J} ch = =a J 4i2 is given in Eq. (3.36) and the neutral weak interaction Lyy_wx 
for fermion f is!* 


(f) 
Lai-wk = Ses a a Tey Vet oe Va Ee 
ear? —rsinty of, Par? GA) 
Specifically, we have for the vector and axial-vector couplings 
(e,,T) +2 (e,u,T) 1 
gy an + 2sinf Ow, 8ga = 
4 1 
agi) c 3 sin? Ow, gi . D = a? 
12 1 (3.41) 
d,s,l “2 d,s,b 
gy Sb = sin Ow, gose = T7 
es ia a 1 es PrE 1 
vem S gn = z. 


Observe the structure of the neutral weak couplings Pane If 6, were to vanish, neu- 
tral weak interactions would be given strictly in terms of Tw3, the third component 
of weak isospin. However in the real world, phenomena like low-energy neutrino 


12 One should be careful not to confuse Eq. (3.40) with the alternate form 


CD e = eZ Top ye +a fyuy wy, 


TP = 2sin? 6w OF m T% 


oF = 2 sin Ow cos Ow 2 sin Ow cos Ow” 


which also appears in the literature. 
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interactions, My/Mz, deep inelastic lepton scattering data, etc. all depend on the 
value of Oy. In addition, we note that because sin? 0y ~ 0.23, the leptonic vector 
coupling constants grr D are substantially suppressed relative to the axial-vector 
couplings. 

Comparison of Eq. (3.34) with Eq. (3.39) yields 


e = g) cosy = go sin Oy. (3.42) 


The Fermi interaction of Eq. (3.34) corresponds in the Weinberg—Salam model to 
a second-order interaction mediated by W-exchange and evaluated in the limit of 
small momentum transfer (1 >> q?/M;j,), 


2 
Z =; A (3.43) 
Together, these relations provide a tree-level expression for the W-boson mass, 
% eee as Ee ov)" Gad 
sin’ Ow /2G F sin Oy 
Also, Eqs. (3.29), (3.43) imply 
v = 274G; ~ 246.221 (2) GeV. (3.45) 


It is the quantity v which sets the scale of spontaneous symmetry breaking in the 
SU (2), x U (1) theory, and all masses in the Standard Model are proportional to 
it, although with widely differing coefficients. 

We shall resume in Chaps. XV, XVI discussion of a number of topics introduced 
in this section, among them the Higgs scalar, the W* and Z° gauge bosons, and 
phenomenology of the neutral weak current. Also included will be a description of 
quantization procedures for the electroweak sector, including the issue of radiative 
corrections. First, however, in the intervening chapters we shall encounter a num- 
ber of applications involving light fermions undergoing electroweak interactions 
at very modest energies and momentum transfers. For these it will suffice to work 
with just tree-level W* and/or Z° exchange, and to consider only photonic or glu- 
onic radiative corrections. We shall also neglect the gauge-dependent longitudinal 
polarization contributions to the gauge-boson propagators (analogous to the gq" 
term in the photon propagator in Eq. (1.18)), as well as effects of the Higgs degrees 
of freedom. For photon propagators, the gq” terms do not contribute to physi- 
cal amplitudes because of current conservation. Although current conservation is 
generally not present for the weak interactions, both the q”q” propagator terms 
and Higgs contributions are suppressed by powers of (m/My)* for an external 
fermion of mass m p. 
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II—4 Fermion mixing 


In our discussion of the Weinberg—Salam model, we limited the number of fermion 
generations to one. We now lift that restriction and consider the implication of 
having n generations. Although the existing experimental situation supports the 
value n = 3, we shall take n arbitrary in our initial analysis. 


Diagonalization of mass matrices 


To begin, it is necessary to generalize the Higgs—fermion lagrangian Lyr of 
Eq. (3.20) to 


=Lir = gr Gi «Pug g gg a Gia Pp p F gor Lj (Peg 
+ gf lf eg, + hc., (4.1) 


where we employ the summation convention œ, = 1,...,n, and adopt the 
notation 


=(u',c',t’,...), 


5 
u 

ED, Piet a9 

d =(d>,8 "Bo ra) 
“= Wn P; Vereen 


eSB st ined: 


(CEE) 
OC) 


Observe that we denote the individual neutrino flavor eigenstates as ve, Vu, Vr, with 
no primes. The states which appear in the original gauge-invariant lagrangian are 
generally not the mass eigenstates. That is, there is no reason why the n x n gen- 
erational coupling matrices g,, g4, Zv, Ze should be diagonal. Following sponta- 
neous symmetry breaking, we obtain the generally nondiagonal n xn mass matrices 
m,, mj, mj, mý from the analog of Eq. (3.28), 


u?’ v? e 


v 

m; = BE (f =u,d,v,e). (4.3) 
Although not diagonal in the flavor basis, these matrices can be brought to diag- 
onal form in the mass basis. The transformation from flavor eigenstates to mass 
eigenstates is accomplished by means of the steps 
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121 = Fr 27 121 a 121 
m „ig + d; mgdg + Lm, Ve + e€ Mer + h.c., 


Et 


—Lr, mass 


= 


7, Ststim sts“ af + d. StS: 'mJS¢84" di 
D, S¥S)'m gt Bh + Z; SES m/SES$ é + hic. 
—i, m, De he Dee ake pian e Ax erate Ge ees 


—im, i i E EA . (4.4) 


The n x n unitary matrices ST p (a =u, d, v, e) relate the basis states, 


and induce the biunitary diagonalizations 
m, = S m, S%’, (a =u,d, v,e), (4.6) 


thus yielding the diagonal quark mass matrices 


m 0 0 ... mı 0 0 
O m 10. ... 0 m O0 az 
m=] 0 0 m ...J> ™=[0 0 m ...|: (4.7a) 


and the diagonal lepton mass matrices 


my, 0 0 Hered Me 0 (0) 
0 my 0 ane (0) My 0 
m=] 0 0 ms n me=| O 0 m, om) 


Although the Weinberg—Salam model is first written down in terms of the flavor 
basis states, actual calculations which confront theory with experiment are per- 
formed using the mass basis states. We must then transform from one to the other. 
This turns out to have no effect on the structure of the electromagnetic and neu- 
tral weak currents. One simply omits writing the primes, which would otherwise 
appear. The reason is that (aside from mass) each generation is a replica of the oth- 
ers, and products of the unitary transformation matrices always give rise to the unit 
matrix in flavor space. Thus, at the lagrangian level, there are no flavor-changing 
neutral currents in the theory. 
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As an example of this, consider the leptonic contribution to the electromagnetic 
current, 


Jem (lept) = ~En y eu = EL aV" eLa — Ra ERa 
= — G18 )a V" (Sper)a — (ErSR ay” (SRer)a 
= —êL aY" EL a — ËR aY" ERa = —aY" ea: (4.8) 
where we sum over family index œ = 1, ..., n and invoke the unitarity of matrices 


S4 g- Note that there is no difficulty in passing the Sf g through y” because the 
former matrices act in flavor space whereas the latter matrix acts in spin space. 


Quark mixing 


Thus far, the distinction between flavor basis states and mass eigenstates has been 
seen to have no apparent effect. However, mixing between generations does 
manifest itself in the system of quark charged weak currents, 


Ji (qk) => QUT Var a => 2üL ay" VagdL,p, (4.9) 
where 
V = S# Ss. (4.10) 


The quark-mixing matrix V, being the product of two unitary matrices, is itself 
unitary. The Standard Model does not predict the content of V. Rather, its matrix 
elements must be phenomenologically extracted from data. For the two-generation 
case, V is called the Cabibbo matrix [Ca 63]. For three generations, it has been 
referred to as the Kobayashi-Maskawa (KM) matrix [KoM 73] after its originators, 
but is now usually denoted by the abbreviation ‘CKM’. We shall analyze properties 
of such mixing matrices for the remainder of this section. 

Ann x n unitary matrix is characterized by n? real-valued parameters. Of these, 
n(n — 1)/2 are angles and n(n + 1)/2 are phases. Not all the phases have physical 
significance, because 2n — 1 of them can be removed by quark rephasing. The 
effect of quark rephasing 


SAA «nd 
UL a > erur a, dia => edy u (a = 1, sey n) (4.11) 
on an element of the mixing matrix is 
Vag > Vage’ 08-20) =1 4.12 
ap ape (a, B = n’ N). (4. ) 


Since an overall common rephasing does not affect V, only the 2n — 1 remaining 
transformations of the type in Eq. (4.11) are effective in removing complex phases. 
This leaves V with (n — 1)(n — 2)/2 such phases. One must be careful to also 
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transform the right-chirality fields of a given flavor in like manner to keep masses 
real. If so, all terms in the lagrangian other than V are unaffected by this procedure. 

For two generations, there are no complex phases. The only parameter is com- 
monly taken to be the Cabibbo angle Oc and we write 


_ (ù cosc  sin6ce 
Yz (< sinc cos I ` a) 


A common notation for the n = 2 mixed states is 


(“") = v(“). (4.14) 
SC S 


Within the two-generation approximation, weak interaction decay data imply the 
numerical value, sin 6c ~ 0.226. 
The three-generation case involves the 3 x 3 matrix 


Vaa Vas Vab 
y= Vea Ves Veb , (4. 1 5) 
Via Vis Vib 


which is a form that emphasizes the physical significance of each matrix element. 
The n = 3 mixing matrix can be expressed in terms of four parameters, of which 
one is a complex phase. The presence of a complex phase is highly significant 
because it signals the existence of CP violation in the theory. We shall return to 
this point shortly. The KM representation employs three mixing angles 012, 013, 023 
and a complex phase 6. It can be viewed as the following Eulerian construction of 
three matrices, 


1 0 0 C13 (0) spe? C2 s2 0 

V=ļ|0 C23 S23 0 1 0 =r]. C12 0 R (4.16) 
0 —S23 C23 — sıze’ 0 C13 0 0 1 

where Sag = SinOug, Cap = COSOap (a, E = 1,2,3). In combined form this 

becomes 

C12C13 S12C13 spe” 

V = | =se — csse? cnez — ssas? sgen |. (4.17) 
512523 — C12€03813e"° =523C19°= 512€73513e" €23C13 


By means of quark rephasing, it can be arranged that the angles {0,g} all lie in the 
first quadrant. In the limit 6.3 = 613 = 0, KM mixing reduces to Cabibbo mixing 
with the identification 0). = 8c. 

An alternative approach for describing the quark mixing matrix, the Wolfenstein 
parameterization [Wo 83], expresses the mixing matrix as the unit 3 x 3 matrix 
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together with a perturbative hierarchical structure organized by a smallness para- 
meter à. In the updated version, the Wolfenstein representation contains four para- 
meters A, A, p, n defined by 


seed, s3 =A), sge? = AX (p — in). (4.18) 


These definitions hold to all orders in A. Since many phenomenological applica- 
tions require accuracy to the level of order A>, we write 


ae r . 
ars À A” A(p — in) 
A215 , ao at 2 2 
V= 5 (1 — 2(p +in)) ee) vA 
3 = . = 2 AM ; Ah 
XA =p 79) —h At (l—2(p+in)) 1- 7 
(4.19) 


Observe that the matrix element Vig is expressed in terms of 6 = p(1 — A7/2) 
and 7 = n(1 — A*/2). These quantities, which are useful in generalizing the so- 
called unitarity triangle (cf. Sect. XIV-5) beyond leading order, are directly cited 
in modern fits of the CKM matrix. 

Attempts to theoretically predict the content of the CKM matrix have not borne 
fruit. The CKM matrix elements have come to be thought of as basic quantities, 
much like particle masses and interaction coupling constants. As such, each matrix 
element must be determined experimentally (with several experiments per matrix 
element). This endeavor, which has been a preoccupation of ‘Flavor Physics’ for 
many years, has finally reached an acceptable level of sensitivity, particularly with 
the operation of several B-factories (cf. Chap. XIV). Current values [RPP 12] for 
the Wolfenstein parameters are 


S020 as A= OSIA os 
p = 0.135+0031, ñ = 0.349+0015, (4.20a) 


Alternatively, we have for the original parameter set, 
+0.0008 0.0014 
S12 = 0.22577 0.0010> S23 = 00415 ouis: 


s13 = 0.00364 9:9004, 5 = (68.9139), (4.20b) 


Neutrino mixing 


Flavor mixing affects not only the quarks, but also the leptons, in the form of 
neutrino mixing. Just as the 3 x 3 quark mixing matrix V is associated with the 
acronym ‘CKM’, there will be a 3 x 3 lepton mixing matrix U for neutrinos. Its 
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standard acronym ‘PMNS’, acknowledges the early work of Pontecorvo [Po 68] 
and of Maki, Nakagawa, and Sakata [MaNS 62]. As we will show in Sect. VI-2, 
when we include the possibility of a Majorana nature of neutrino mass, lepton mix- 
ing has a form very similar to quark mixing, 


U = V®P,, (4.21) 


where V™ has the same mathematical content as the quark mixing matrix V of 
Eq. (4.17) except that the mixing angles {6;;} and phase ô now pertain to the neu- 
trino sector and 


1 0 0 
P,= {0 em/2 0 . (4.22) 
0 0 e'%2/2 


Here, the {œ;} are so-called Majorana phases. They are physical, i.e., observable, 
if the Majorana neutrino option is chosen by Nature. Although not contributing to 
neutrino oscillations, they will occur in the neutrinoless double beta decay 
(cf. Sect. VI-5) of certain nuclei. 

Thus far, information about the lepton mixing matrix has come from fits to neu- 
trino oscillation data (although highly anticipated searches for neutrinoless double 
beta decay are underway, cf. Sect. VI-5). There is no evidence at this time for CP 
violation in the lepton sector, so one cannot yet distinguish between the Dirac and 
Majorana cases described above. For either, the leptonic mixing angles are mea- 
sured to be [RPP 12]! 


sin? (20,2) = 0.857 + 0.024, 
sin? (243) > 0.95 (at 90%C.L.), 
sin?(2013) = 0.098 + 0.013, (4.23) 


which translates into angles (here we take the value for 623 from Table 13.7 of 
[RPP 12]) 


O12 = (33.941.0)°, 3 = 40.4445)", 65 = (9.1 + 0.6)°. (4.24) 


These values are quite different from the quark mixing angles inferred from 
Eq. (4.20b). Whereas the quark mixing matrix is a ‘zeroth-order’ unit 3 x 3 matrix 
modified by perturbative entries proportional to powers of the smallness parameter 
A, current data for lepton mixing are consistent with the ‘zeroth-order’ representa- 
tion!* 


13 See also [FoTV 12] and [FoLMMPR 12]. 
14 This form, referred to as tri-bimaximal mixing [HaPS 02], is used here as simply a numerical convenience. 
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2 a 0 
V6 V3 
vw=|-+ z= 4 (4.25) 
DERE 
V6 B VA 
Perturbative modifications of this can be introduced as 
1 1 
sin 013 = = sin 012 = uae sin 023 = oe (4.26) 


Jt V3 ao 


where the r, s, a parameters are sensitive, in part, to reactor, solar, and atmospheric 
data, yielding V? —> V), where [KiL 13] 


— (1+s+a-re’) =- (145404 Le) Ca 
V2 
(4.27) 


In the above ô is the phase parameter which reflects the possibility of CP violation 
in the lepton sector but for which there is, as of yet, no evidence. For the others, 
the current limits 


r= 0.22 + 0.01; s = —0.03 + 0.03, a = 0.10 + 0.05, (4.28) 


imply that we are not very far from this tri-bimaximal form. 


Quark CP violation and rephasing invariants 


There is no unique parameterization for three-generation quark mixing. Any 
scheme which is convenient to the situation at hand may be employed as long as 
it is used consistently and adheres to the underlying principles. There is, however, 
a somewhat different logical position to adopt, that of working solely with rephas- 
ing invariants. After all, only those functions of V which are invariant under the 
rephasing operation in Eq. (4.18) can be observable. An obvious set of quadratic 
invariants are the squared moduli AY = AK where i, j = 1, 2,3. The unitarity 
conditions V'V = VV‘ = I constrain the number of independent squared-moduli 
to four. They are of course all real-valued. In addition there are quartic functions 
AY = Vij VaVi Vij? where we suspend the summation convention for repeated 
indices and, to avoid redundant factors of squared-moduli, take a,i,k (b, j, D) 
cyclic. There are yet higher-order invariants, but they are all expressible in terms of 
the quadratic and quartic functions. The nine quantities AY are generally 
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complex-valued. A unique measure of CP violations for three generations is pro- 
vided by the rephasing invariant Ima, 


Im [Vij Vir Vit Vig] = JD Gikm i (4.29) 


m,n 


where J is the so-called Jarlskog invariant [Ja 85], 
J= €12€73€238 1281382385 = Ai + OG.) = (2.96+0:30) X io~; (4.30) 


This combination of quark mixing parameters will always appear in calculations 
of CP-violating phenomena. To have nonzero CP violating effects, the KM angles 
must avoid the values 6;; = 0, 7/2, and ô = 0, x. The CP-violating invariant J 
achieves its maximum value for c13 = 2/ v3, C12 = C3 = 1/2, Ss = 1 at which 
it equals 1/643. This set of circumstances is very unlike the real-world value in 
Eq. (4.30). 

The consideration of rephasing invariants need not involve just the mixing matrix 
V, but can also be applied to the Q = 2/3, —1/3 nondiagonal mass matrices 
m/,, m, themselves. In particular, the determinant of their commutator is found to 
provide an invariant measure of CP violations [Ja 85]. If, for simplicity, we work 
in a basis where m,’, m/, are hermitian, it can be shown that Ss; = See = S44, 
Thus we have 


[m’,, m/,] = S“ [m,, Vm; VÝ] S" = S“V [V'm,V, mg] VIS, (4.31) 


u?’ 


from which it follows that 


det [m’,, m/,] = det [m,, Vm; VÝ] = det [V'm,V, my]. (4.32) 


u?’ 


The two commutators on the right-hand sides of this relation are skew-hermitian 
and each of their matrix elements is multiplied by a Q = 2/3, —1/3 quark mass 
difference, respectively. The determinant is thus proportional to the product of all 
Q = 2/3, —1/3 quark mass differences, and explicit evaluation reveals 


det [m/,, mi] = 2i IMA® | |0nu a — mup)(Mae — ma.p). (4.33) 


a>B 


This provides a more extensive list of necessary conditions for CP violations to be 
present. Not only are the mixing angles constrained as discussed above, but also 
the quark masses within a given charge sector must not exhibit degeneracies. 

Our discussion of the n = 2, 3 generation cases suggests how larger systems n = 
4,... can be addressed, although the number of parameters becomes formidable, 
e.g., four generations require six mixing angles and three complex phases. How- 
ever, existing data indicate the existence of just three fermion generations, e.g., 
measurements from Z°-decay fail to see additional neutrinos, and there is no 
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evidence from e*e~ or pp collisions for additional quarks or leptons lighter than 
roughly 50 GeV. Moreover, if there were more than three quark generations the 
full quark-mixing matrix would be unitary, but one would expect to see violations 
of unitarity in any submatrix. Yet to the present level of sensitivity, the 3 x 3 KM 
mixing matrix obeys the unitarity constraint. The most accurate data occur in the 
(VV) = 1 sector. Here, the contribution from Va» is negligible and one finds 
[Ma 11, HaT 10] 


(VIV)i1 = |Vual” + [Vas + Vul? = 0.999916). (4.34) 
Problems 
(1) SU(3) 
(a) Starting from the general form AF, Ag; = Aôijôkı + Bôiô jk + Côikô jı (a = 
1,...,8 is summed), determine A, B, C by using the trace relations of 


Eqs. (II-2.5a, 2.5b), etc. 

(b) Determine TrA@A?A°. 

(c) Determine CijkElmn Ae pAb Agi Ay. 

(d) Consider the 8 x 8 matrices (Fa)be = —ifabe, where the {fabe} are SU (3) 
structure constants (a, b,c = 1, ..., 8). Show that these matrices (the regu- 
lar or adjoint representation) obey the Lie algebra of SU (3), and determine 
Tr Fa Fp. 

(2) Gauge invariance and the QCD interaction vertices 

(a) Define constants f, g such that the covariant derivative of quark q is Daq = 
(0. + if A,„)q and the QCD gauge transformations are A, > U A „UT! + 
ig-'Ud,U~! and q —> Uq, where A, are the gauge fields in matrix form 
(cf. Eq. I-5.15)). Show that 4 P q is invariant under a gauge transformation 
only if f = g. 

(b) Define a constant h such that the QCD field strength is Fay = 0,A, — 
d,A,+ih[A,, A,]. Let the gauge transformation for A, be as in (a). Show 
that Fu» transforms as Fay > UF ae only ifh = g. 

(3) Fermion self-energy in QED and QCD 

(a) Express the fermion QED self-energy, —iX(p), of Fig. II-2(b) as a 
Feynman integral and use dimensional regularization to verify the forms 
of ZS), 5m™) appearing in Eqs. (1.34), (1.35). 

(b) Proceed analogously to determine Z for QCD and thus verify Eq. (2.55). 

(4) Gravity as a gauge theory 
The only force which remains outside of the present Standard Model is gravity. 
General relativity is also a gauge theory, being invariant under local-coordinate 
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transformations. The full theory is too complex for presentation here, but we 
can study the weak field limit. In general relativity the metric tensor becomes 
a function of spacetime, with the weak field limit being an expansion around 
flat space, g” (x) = g” + h” (x), with 1 >> h”. Let us consider weak field 
gravity coupled to a scalar field, with lagrangian L = Lgrav + Lmatter defined as 


1 = 7 = 
Leray = ——— [dahu ð h” — 20*h, doh"? | , 
g 641 Gy [ Atu uà ] 
1 1 f j 
L matter = 2 (1 = zm) [e at h” )ðLp ap = mo" | ` 
where h” = h” — g””hł}/2, all indices are raised and lowered with the flat 


space metric g””, and Gy is the Cavendish constant. 

(a) Show that the action is invariant under the action of an infinitesimal coor- 
dinate translation, x“ —> x’ = x" + e€"(x) (1 > e“(x)), together with a 
gauge change on h"”, 


g(x) > g'a) = px), 
h” (x) > h (x) = ht” (x) + 3e” (x) + OP EM (x). 


Note: both €” and h”” are infinitesimal and should be treated to first order 

only. 

Obtain the equations of motion for g and h””. The source term for h“” is 

T””, the energy-momentum tensor for g. Use the equation of motion for 

h”” to show that 7” is conserved. Simplify the equations with the choice 

of ‘harmonic gauge’, d,h“” = 0. 

(c) Solve for h“” near a point mass at rest, corresponding to T°? = M8® (x) 
and T”! = TË = 0. Performa nonrelativistic reduction for g, i.e., p(x, t) = 
e~'™' G(x, t), in order to obtain a Schrödinger equation for @ in the gravita- 
tional field. Verify that Newtonian gravity is reproduced. 


(b 


wm 
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Symmetries and anomalies 


Application of the concept of symmetry leads to some of the most powerful tech- 
niques in particle physics. The most familiar example is the use of gauge symmetry 
to generate the lagrangian of the Standard Model. Symmetry methods are also valu- 
able in extracting and organizing the physical predictions of the Standard Model. 
Very often when dealing with hadronic physics, perturbation theory is not applic- 
able to the calculation of quantities of physical interest. One turns in these cases 
to symmetries and approximate symmetries. It is impressive how successful these 
methods have been. Moreover, even if one could solve the theory exactly, symme- 
try considerations would still be needed to organize the results and to make them 
comprehensible. The identification of symmetries and near symmetries has been 
considered in Chap. I. This chapter is devoted to their further study, both in general 
and as applied to the Standard Model, with the intent of providing the foundation 
for later applications. 


IfI-1 Symmetries of the Standard Model 


The treatment of symmetry in Sects. I-4, I-6 was carried out primarily in a general 
context. In practice, however, we are most interested in the symmetries relevant 
to the Standard Model. Let us briefly list these, reserving for some a much more 
detailed study in later sections. 

Gauge symmetries: As discussed in Chap. II, these are the SU (3). x SU(2); x 
U(1)y gauge invariances. It is interesting to compare their differing realizations. 
SU (3), is unbroken but evidently confined, whereas SU (2); x U(1)y undergoes 
spontaneous symmetry breaking, induced by the Higgs fields, leaving an unbroken 
U (Lem gauge invariance. 

Fermion-number symmetries: There exist global vector symmetries correspond- 
ing to both lepton and quark number. These are of the form 
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Wa > eee ipa (1.1) 


for fields of each chirality. The index a refers to either the set of all leptons or the 
set of all quarks, and the conserved charges Q, are just the total number of quarks 
minus antiquarks and the total number of leptons minus antileptons.! Conservation 
of baryon number B is violated due to an anomaly in the electroweak sector, but 
B — L remains exact. 

Global vectorial symmetries of QCD: If the quarks were all massless, there 
would be a very high degree of symmetry associated with QCD. Even if m Æ 0, 
symmetries are possible if two or more quark masses are equal. Three of the quarks 
(c, b, t) are heavy compared to the confinement scale Agcp and widely spaced in 
mass, so they cannot be accommodated into a global symmetry scheme.” However, 
the u, d, and s quarks are light enough that their associated symmetries are useful. 
The best of these is the isospin invariance, which consists of field transformations 


y= (‘) > y' =exp(—it - 0) (1.2) 


where {t'} (i = 1, 2,3) are SU (2) Pauli matrices and {6‘} are the components of 
an arbitrary constant vector. Associated with the SU (2)-flavor invariance are the 
three Noether currents 


i 


JË = $y 13 
Isospin symmetry is broken by the up—down mass difference, 


My 


Luss E Giu 4 dd) — a i ~ dd) (1.4) 
and by electromagnetic and weak interactions. Inclusion of the strange quark 
extends isospin to SU (3)-flavor transformations 


u 
y = | d | > W =exp(-i0 -A)y, (1.5) 
s 


where {åf} (a = 1,2, ..., 8) are the SU (3) Gell-Mann matrices. The SU (3)-flavor 
symmetry is broken significantly by the strange quark mass, and to a lesser extent 
by other effects. Predictions of isospin symmetry work at the 1% level, whereas 
SU (3) predictions hold only to about 30%. It is occasionally convenient to employ 


1 In Chap. VI, we return to the study of lepton-number violation through possible Majorana mass terms 
2 See, however, the discussion of the dynamical heavy-quark symmetries in Chap. XII-3 
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a particular SU (2) subgroup of SU (3), called U-spin, which corresponds to the 


transformations 
d d 
( ) — exp(-iT - ( | (1.6) 
KY sS 


U spin is also a symmetry of the electromagnetic interaction, since its generators 
commute with the electric-charge operator. The U -spin symmetry is broken by the 
large d-quark, s-quark mass difference. 

Approximate chiral symmetries of QCD: The vectorial symmetries are valid if 
quark masses are equal. If the masses vanish, there are additional chiral sym- 
metries, because in this limit the left-handed and right-handed components of the 
fields are decoupled (cf. Sect. I-3), 


1 = 7 
Loco) = =a + VPY + VRP vr, (1.7) 
m=0 
i.e., the left-handed and right-handed fields have separate invariances. For massless 
up-and-down chiral quarks, the symmetry operations are 


Wi > exp(—i0,-t)W, = LYL, Wr— exp(—iOe-t)Wr=Ryer_ (1.8) 


where wW pr are chiral projections of the y doublet in Eq. (1.2). These can also be 
expressed as vector and axial-vector isospin transformations, 


y > exp(—i0y - t)y, Y > exp(—i04 : Tys)W (1.9) 


with Oy = (0; + Or)/2, and 04 = (0L — 0pr)/2. This invariance is variously 
referred to as chiral-SU (2), SU(2); x SU(2)p, or SU(2)y x SU(2),4. In OCD, it 
is broken by quark mass terms, 


Lmass = M, itu — madd = —m,(üLur + ūruL) — mal(dıdr +drdz). (1.10) 


Thus, if m, = m4 40, separate left-handed and right-handed invariances no longer 
exist, but rather only the vector isospin symmetry. The generalization to three mass- 
less quarks defines chiral SU (3) (or SU(3), x SU (3)r) and is a straightforward 
extension of the above ideas. 

Discrete symmetries: Since the Standard Model is a hermitian and Lorentz- 
invariant local quantum field theory, it is invariant under the combined set of 
transformations CPT. Both QCD (given the absence of the 0-term) and QED con- 
serve P, C, and T separately. By contrast, the electroweak interactions have max- 
imal violation of P and C in the charged-current sector. If a nonzero phase resides 
in the quark-mixing matrix, there will exist a breaking of CP, or equivalently of 7, 
invariance. Otherwise the weak interactions are invariant under the product CP. 
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In addition to the above exact or approximate symmetries of the Standard Model, 
there are some important ‘non-symmetries’ of QCD. By these we mean invariances 
of the underlying lagrangian, which might naively be expected to appear as sym- 
metries of Nature but which, for a variety of reasons, do not. These include the 
following. 

Axial U(1): The QCD lagrangian would have an axial U(1) invariance of the 
form 


y= |d| ow =e "Sy, (1.11) 


S 


if the u, d,s quarks were massless. However, this turns out not to be even an 
approximately valid symmetry, as it has an anomaly. We shall return to this point 
in Sect. HI-3. 

Scale transformations: If quarks were massless, the QCD lagrangian would con- 
tain no dimensional parameters. The lagrangian would therefore be invariant under 
the scale transformations 


W(x) > BPwAx), AS (x) > AAT (AX), (1.12) 


where w and A‘ are respectively the quark and gluon fields. This invariance is also 
destroyed by anomalies (see Sect. III-4). 

‘Flavor symmetry’: Because the gluon couplings are independent of the quark 
flavor, one often finds reference in the literature to a flavor symmetry of QCD. 
Unless the specific application is reducible to one of the above true symmetries, 
one should not be misled into thinking that such a symmetry exists. For example, 
flavor symmetry is often used in this context to relate properties of the pseudoscalar 
mesons 7(549) and n’ (960) (or analogous particles in other nonets). However, the 
result is rarely a symmetry prediction. Rather, this approach typically pertains to 
specific assumptions about the way quarks behave, and is dressed up by incorrectly 
being called a symmetry. In group theoretic language, this may arise by assuming 
that QCD has a U (3) symmetry rather than just that of SU (3). 


IHI-2 Path integrals and symmetries 


The transition from classical physics to quantum physics is in many ways most 
transparent in the path-integral formalism. In this chapter we use these techniques 
to provide a quantum description of symmetries, complementing the treatment at 
the classical level of Sects. I-4, I-6. A brief pedagogical introduction to those 
path-integral techniques which are important for the Standard Model is provided 
in App. A. 
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The generating functional 


In order to implement a quantum description of currents and current matrix ele- 
ments, one studies the generating functional, Z, of the theory. For a generic field 
gy, we have 


Zfl=e™l = [ico expi f d'x (Lp, 99) — jp), (2.1) 


where j (x) is an arbitrary classical source field whose presence allows us to probe 
the theory by studying its response to the source. The symbol [dø] indicates that 
at each point of spacetime one integrates over all possible values of the field g(x). 
All the matrix elements needed to describe physical processes in the theory can be 
obtained from In Z[j] by functional derivations, i.e., 


8" In Z[j] 
57 Gu). 87 ip) py 


(0|T (ex)... e(xp))| 0) = O” (2.2) 
where n is the number of fields in the matrix element. If there is more than one 
field, i.e., the set {g;}, a separate source is introduced for each field. 

If one wants to study a given current J” (not to be confused with the source j) 
associated with some classical symmetry, one simply adds an extra classical source 
field v,, which is coupled to that current, 


Zl = [ie expi f atx (L- jp — v, J”). (2.3) 
In this case all matrix elements involving J” can be obtained by functional deriva- 


tion with respect to v,, 


= .ôlnZ 
JP (Z) =i 
ôv (x) 


(2.4) 


v =0 
where the bar in J” indicates that it is a functional describing matrix elements of 
the current J”. Specific matrix elements are obtained by further derivatives, as in 
2 


ô = 
oie ye 0) =i 2 A 2.5 
OTE S E a j=0 = 


This device allows one to discuss all possible matrix elements of the current J“. 
As an example, consider the vector and axial-vector currents of QED. We define 


Z[vu Oy] = J [ayida A oma a (2.6) 


III-2 Path integrals and symmetries 81 


A three-current (connected) matrix element is obtained then as 


Trap (Xs Y, Zeonn = (0 |T (Yy ys wE) VO) Yah (y) WZ) ypv(z))| 0) 


2 
= (i)? | 2 : In z 
ôv (y)ôvf (z) ĉar (x) t= 


at= 


oo 


2 


3 , 
a. a O 
dv%(y)dvF (z) oa oe 


= (i) 


where the axial-vector quantity Jsu is defined in analogy with Eq. (2.4). 


Noether’s theorem and path integrals 


Returning to the general case, let us consider an infinitesimal transformation of a 
set of fields {g;} (cf. Eq. A-3.1)) 


Gi > P; = pi + E(x) fi (p) (2.8) 


such that the current under discussion is 


ƏL’ 
J= : (2.9) 
0(0,€) 
If this is a symmetry transformation, one has up to a total derivative, 
L' = L (¢', 09’) = L (Q, 09) + J” ae. (2.10) 


If e(x) is a constant, the lagrangian is invariant under the transformation. This is the 
statement of the classical symmetry condition. In order to study the consequences 
of this situation, we rewrite our previous definition of the current matrix elements 


Jx) =i 


5 
ae In Z[vy] (2.11) 


in integral form by noting 
ôln Z[v,] = In Z[v, + ôv] — In Z[v,] = —i l dx J (x)v(x), (2.12) 
which is just the inverse of Eq. (2.11). Now choosing the particular form for ôv,, 
dv, (x) = —d,€(), (2.13) 
we have 
ôe In Zl v,] = In Zu, — 0,€] — In Z[v,] 


= if ax J” (x)ð E(x) = if dreo (2.14) 


82 Symmetries and anomalies 


With this procedure we can isolate a divergence condition for J“. If Z [vy — ðE] = 
Z[v,], then ð, J” (x) = 0. To check this, consider 


Z[v, — ð El] = fidone: | d‘x (Li, dpi) — (Va — d,€) J") : (2.15) 


If we can change integration variables so that 


J idol = / [dg] (2.16) 


with g; given by Eq. (2.8), then we obtain 


Z[v„, — due] = [iden expi [as (Lloi, dpi) + vaJ”) = Ziv], 217 
and therefore 
ðJ" C=O, (2.18) 


This change of variables seems reasonable and in most cases is perfectly legitimate. 
After all, the symbol [dg; (x)] means that we integrate over all values of the field 9; 
separately at each point in spacetime. Shifting the origin of integration at point x by 
a constant, g; (x) = g}(x) —e(x) fi, and then integrating over all values of g; should 
amount to the original integration. Given this shift, we have obtained in Eq. (2.18) 
by Noether’s theorem a quantum conservation law involving matrix elements. The 
expression 0,, J“ (x) = 0 means that all matrix elements of J”, obtained via further 
functional derivatives (as in Eq. (2.5)), satisfy a divergenceless condition, i.e., of 
the current J“ is conserved in all matrix elements. 

It was Fujikawa who first pointed out the consequences if the change of vari- 
ables, Eq. (2.16), is not a valid operation in a path integral [Fu 79]. Certainly, 
many procedures involving path integrals need to be examined carefully in order 
to see if they are well defined. We shall explicitly study some examples in which 
the change of variable is nontrivial and can be calculated. In such cases one finds 
On J"(x) 4 0, which implies that the classical symmetry is not a quantum symme- 
try. In these situations it is said that there exists an anomaly. 


MI-3 The U(1) axial anomaly 


For massless quarks m, = mg = m; = 0, the QCD lagrangian contains an invari- 
ance Locp —> Loecp under the global U (1) axial transformations 


y= |d| > y =e My. (3.1) 
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In this limit, which we shall adopt until near the end of this chapter, Noether’s 
theorem can be applied to identify the classically conserved axial current, 


Je) = iiyywysu + dypysd +5ypyss, JẸ =0, (3.2) 


where the superscript on hs denotes an SU (3) singlet current. We shall see that 
this is not an approximate symmetry of the full quantum theory because the current 
divergence has an anomaly. This can be demonstrated in various ways. For a direct 
‘hands-on’ demonstration, the early discussion [Ad 69, BeJ 67, Ad 70] of Adler 
and of Bell and Jackiw, which we recount below, has still not been improved upon. 
However, for a deeper understanding, Fujikawa’s path-integral treatment [Fu 79], 
also described below, seems to us to be the most illuminating. The effect of an 
anomaly is simply stated, although one must go through some subtle calculations 
to be convinced that the effect is inescapable. An anomaly is said to occur when a 
symmetry of the classical action is not a true symmetry of the full quantum theory. 
The Noether current is no longer divergenceless, but receives a contribution arising 
from quantum corrections. It is this contribution which is often loosely referred 
to as the anomaly. The Ward identities which relate matrix elements no longer 
hold, but rather are replaced by a set of anomalous Ward identities, which take into 
account the correct current divergence. 

There are two applications of the axial anomaly which have proved to be of 
particular importance to the Standard Model. One is in connection with the SU (3) 
singlet axial current described above. Here the anomaly will end up telling us that 
the current is not conserved in the chiral limit, but rather that 


3a ~ ~ 1 
(0) S pa auv a vaf pa 
el = m p (fs. = 56" Fes) . (3.3) 


This will serve to keep the ninth pseudoscalar meson, the n’, from being a pseudo- 
Goldstone boson. 

The other application is in the decay m? —> yy, which is historically the process 
wherein the anomaly was discovered. The quantity of interest here is an isovector 
axial current I which transforms as the third component of an SU (3)-flavor 
octet, 


Ie = UY Ysu — dy, ysd. (3.4) 
Without the anomaly, one would expect that the current i would be conserved 


in the chiral SU (2) limit even in the presence of electromagnetism. This follows 
from the apparently correct procedure 
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at ds) =ü |(P—iOA) rs -rs +i OA)| u 
—d|(9—iOA) ys — ys@+iOA)|d =0. 


However, explicit calculation shows that the current has an anomaly, such that 


(3.5) 


aN. Z 

Fiuy Fk’, 3.6 
6x " ee 
where F,» is the electromagnetic field strength. This will be important in predicting 
the 7° — yy and n° — yy rates and serves as a test for the number of quark 
colors. 


ca = 2i (myitysu — madysd) + 


Diagrammatic analysis 
To review the work of Adler and of Bell and Jackiw, we first consider the Ward 


identities for the coupling of the U (1) axial current to two gluons. We define 


T2 (kq) =i J dx d'y eten for (IRR) o), 6D 


where J$ is a flavor-singlet (color-octet) vector current coupled to gluons 
=y g A (3.8) 
A ae as : 


It is important to understand that the SU(3) matrices pertain here to the color 
degree of freedom and should not be confused with analogous matrices which 
operate in flavor space. The amplitude ihe is related to the vacuum-to-digluon 
matrix element by 


(G 1, q) Ga, —k — q)|JO|0) = igzel "ef? 72. (kg). 6D 


There are two Ward identities, representing the conservation of axial and vector 
currents. The vector Ward identity, corresponding to color current conservation, 
a* J? = 0, is 


gta =0. (3.10) 


The axial Ward identity is derived in a similar fashion using the assumed conser- 
vation of the U(1) axial current in the massless limit, 


PJ. C= 0, (3.11) 
to yield 
kU Tf? (k, q) = 0. (3.12) 
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Fig. UI-1 Triangle diagrams associated with the axial anomaly. 


In order to reveal the anomalous behavior of this coupling, we calculate the 
vertex in lowest-order perturbation theory via the triangle diagrams of Fig. III-1. 
With the momenta as labeled in the figures, this produces the amplitude 


rib a dtp ” 1 a 1 às 1 
= — — T ‘on 
uap Onl As 2 pag" aa 


1 ae 1 | Ji 
E Eyr" ay) | P 


+ Tr g y5 


where the prefactor of 3 arises from the three massless quarks, each of which con- 
tributes equally. 

Observe that these integrals are linearly divergent, and so may not be well defined. 
In particular, there exists an ambiguity corresponding to the different possible ways 
to label the loop momentum. An example will prove instructive, so we consider the 
integral 


— 4 Py _ (p—*)y 
n= f a'r |B ol: pal 


This is evaluated by transforming to Euclidean space, where po = ips and p° = 
-p4 — p? = — p}. In order to perform the integration, one may note that for a 
general function, F(p), whose four-dimensional integral is linearly divergent (i.e., 
one with p°>F(p) Æ 0, but p? F'(p) = pF" (p) =... = 0 for p —> oo), one finds 
by Taylor expanding and using Gauss’ theorem that 


4 = _ — 4 H = Sy 
d'pelF(p)— F(p- 1) = | ape | 0, F(p) gh © OOF (p) +++ 


H 3 to 
=f fès P= 0, F(p)+-:-: 


p>oo 


= e fas, F(p) (3.15) 


poo 
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where d°S,, indicates integration over a three-dimensional surface at p —> ov.3 
Applying this result to the case at hand, we obtain a surface integral 


, Pp (p-£) . p : Pu P 
I, =i | d (2 - eo) =i [a's a= =iet f aisPePe. 
k | PE\ pi (p-o " pt p p’ 


(3.16) 


Note that from euclidean covariance we can replace p, py by uy p/4, to yield 


2 
lL, = = [ess = o (3.17) 
where the last step uses the surface area of a three-dimensional surface in four- 
dimensional euclidean space, S4 = 277 R°. 
In the case of T g» consider the effect of shifting the integration variable of the 
first term in Eq. (3.13) no p to p + big + bo(—k — q). In order to maintain 
the Bose symmetry of T, a (i.e., symmetry under the interchange a <> £ at the 
same time as q <> (—k — q)) we must shift the second integration from p to 


p + bı(—k — q) + bog. Use of Eqs. (3.14)-(3.17) then yields the change in Ta 


160? 
AT ius = aya nebr ll” (big + ba — K) — I” iq — k) + bog) 
3500 
= —Téq2 a bz) €papy (24 F KY” (3.18) 


induced by the shift of the original integration variable p”. This is an indication 
that there may be trouble in the calculation of this diagram, but it is not yet proof 
of any violation of the Ward identities. 

Let us now check the Ward identities. In both cases, use can be made of identities 
similar to q% = p* — (p% — q”) in order to change the result into a difference of 
integrals. We find for the vector Ward identity 


Tg) 

_ 30" f dp "| J 

== a nng” ma d -ungrar p 
g Se (p+ k+q)’p J 

E E a i a9) 

(pt+k)*(p-q) (pt+k+q)*p’ 


= —6i Pe goo 
l Eupp Ta 


3 Note that this is just the four-dimensional generalization of the one-dimensional formula 


me oP 1 1 2 
Í dxtfe+y)-f@I= Í dx [s SEROR =] = y[f (co) — f(—00)], 
= —00 


valid for f (+00) 4 0 but f' (+00) = f”(to00) =... = 0. 
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while for the axial-vector case, 
35%” d'p 1 


1 1 
k“ TA (kg) = 7 Qn c Degman r F J” 


1 
era A a rE” 


zje +tk+a pP (p++ -o 
(p+k+4PpP  (p+k?(p-4q? 
(p+ (ptk+4Y (p- | 
(p+k)(p+k+4} (p-4}pP] 


= —6i ô” Euppo 


(3.20) 


It is easy to see that if one could freely shift the integration variable, each expres- 
sion would separately vanish. However, direct calculation using Eqs. (3.14)-(3.17) 
yields 
ab a ab 
T= ~ Euppok?q? and kT% uap (Ks) = gaa Epek q° 
(3.21) 


ete a= 


If, on the other hand, the original integration variable were shifted as in Eq. (3.18) 
one would obtain 
ab 


167? 
ab 


~ 
KPT ee (Kk, q) = =— (1 — by + b2) €appo k” q7 


Tr (kg) = — (1 + bi — bz) Eugpo kq, 


(3.22) 


Thus, either one of the original Ward identities may be regained by a particular 
choice of bı — bz, but both expressions cannot vanish simultaneously. 

Our discussion of the manipulations of Feynman diagrams should not obscure 
the main physical fact illustrated above, i.e., despite the claim of Noether’s theo- 
rem that there are two sets of conserved currents (vector SU (3) of color and axial- 
vector U (1)), one-loop calculations indicate that only one can in fact be conserved. 
On physical grounds, we know that in Nature the vector current is conserved, as 
its charge corresponds to QCD color charge. Thus, it must be the axial current 
which is not conserved. This phenomenon is at first sight quite surprising and it 
deserves the name ‘anomaly’ by which it has come to be called. Noether’s theo- 
rem has misled us, and it is only by direct calculation of the quantum corrections 
that the true symmetry structure of the theory has been exposed. Note that the sit- 
uation is not the same as spontaneous symmetry breaking, where the symmetry 
is hidden by dynamical effects. There the currents remain conserved, as demon- 
strated in Sect. I-6. Here, current conservation has been violated. In particular, 
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the calculation described above (with bı — b2 = 1) is consistent through use of 
Eq. (3.9) with the operator relation of Eq. (3.3), 


30, o 
Os pa paw, (3.23) 


mw yO) _ 
ð su as 4r YY 


Both sides of this equation have the same two-gluon matrix elements. It is clear 
from this that the apparent U(1) symmetry predicted by Noether’s theorem is not 
a symmetry of the quantum theory after all. 


Path-integral analysis 


In a path-integral treatment [Fu 79], the symmetry of the theory can be tested by 
considering the generating functional, as described in Sect. HI-2. In particular, if 
we consider a functional of the gluon field A? and an axial current source a,,, 


Z [a,, AS] = J [dw]ldW]expi i d*x (Loco J, AS) — a, go”) (3.24) 
then the steps leading to Eq. (2.14) produce 
—i J d*x BI IQ) (x) = In Z [ay — 9,8, AŻ] — In Z [au AP], (38.25) 


where i (x) denotes the matrix elements of the current J. o 


70) = b 
aR CER (3.26) 
In particular, the two-gluon matrix described above is given by 
TŒ, y, z) = (D? — Ë JL (x) | (3.27) 
5Ax(y)8 A} (2) Axo 


In order to solve for lar i , we note that the „£ term can be absorbed into a 


redefinition of the fermion fields. This can be seen from the identity (for infinitesi- 
mal £), 


Vial + IB wy" ys =v (A — iBys)ip (1 —ipys)w. (3.28) 
The following quantities are invariant under this transformation: 


VA My = YA — iby AAA — iBys)y, 


ae | (3.29) 
Ju = Wb = YA — ibys)y 0 — iBys)y. 
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Mass terms would not be invariant, but we are presently working in the massless 
limit. Therefore, if we define 


y'=- ißbys)y = ep + OB’, 
Y'= Wl — iBys) = pe" + OB, 

we see that the lagrangian can be written in terms of y’, 
Loco. Y, AS) + 3B IS) = Loco(y', Y', AS). (3.31) 


Furthermore, we would like to change from y to yy’ in the path integration. To be 
general, we allow for the possibility of a jacobian 7 accompanying this change of 
variables, viz., 


(3.30) 


/ [dvd = l CEARA (3.32) 


If, as will be shown later, the jacobian J is independent of y and Y, it can be taken 
to the outside of the path integral, resulting in 


Z [ay — 9,8, Ai] = l [dw dp] g e S Ox Coco Apa5) 


(3:33) 
=JZ [au Ai] : 
Thus, the test for the symmetry, Eq. (3.25), depends entirely on J, 
Ihng = -i J d*x B(x)a" Js) (x). (3.34) 


The lesson learned is that if the lagrangian and the path-integral measure are invari- 
ant under the U (1) transformation, then there exists a U(1) symmetry in the theory, 
with ae = 0. However, if the lagrangian is invariant, as it is in this case, but the 
path integral is not (i.e. J # 1), then the U(1) transformation is not a symmetry 
of the theory, i.e., aH Ig? #0. 

We shall show below that the jacobian, when properly regularized, has the form 


Os 


3 
4 


- z] ; (3.35) 


J = exp (—2i tr Bys) = exp l- J dx B(x) 


so that the current divergence has the form given in Eq. (3.3), 


= 3a Š 
(COSS S pa apv 
Ə" Jó = ae For 
Functional differentiation using Eq. (3.27) yields the same result for q” Ty as 


obtained in ordinary perturbation theory. The nontrivial transformation of the 
path-integral measure has prevented the axial U (1) transformation from being a 
symmetry of the theory. We now turn to the calculation of the jacobian. 
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The jacobian in fact diverges, and a regularization is needed in order to make 
it finite. In Fujikawa’s original calculation the regularizer was introduced early 
into the procedure, allowing each step to be well defined. We will be slightly less 
rigorous by introducing the regularizer somewhat later. In order to calculate the 
jacobian we need to review the properties of integration over Grassmann numbers 
(which are described in more detail in App. A-5). The anticommuting nature of the 
variables requires that any function constructed from them terminates after linear 
order in each variable. Thus, a function of two Grassman numbers z1, Z2 (z}Z2 = 
2221; a = 0) becomes 


Sf (1, 22) = fot fizi + foz2 + f122122, (3.36) 


where fo, fi, f2, fiz are real numbers. The primary property of an integral to be 
transferred to Grassmann numbers is completeness, i.e., 


fe fac — fe f(z+2z), (3.37) 


where z’ is a constant Grassmann number. Expanding both sides we have 


fae Got fan = faz (fo fic + fz) 3.38) 
For this to be true, the condition 
/ dz=0 (3.39) 
is required. Now consider a change of variables 
Zi = C1171 + C122), Z2 = C2171 + C222), (3.40) 
involving a matrix of coefficients C. The jacobian is defined by 
J iad fey= I fazi dz, f (Cz’). (3.41) 
Application of Eq. (3.36) leads to the consideration of only the f2 term, 
fiz / dzı dz2 z122 = J f2 l dzi dz% (cuz, + €1225) (C2124 + C2224) 
= J fi2(e11022 — €12€21) J dz, dz, 2425, (3.42) 


and hence the identification of the jacobian, 


J = [det C]! . (3.43) 
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Although derived in the simple 2 x2 case, Eq. (3.43) generalizes to arbitrary dimen- 
sion. Note that, due to the Grassmann nature of the variables, this result is the 
inverse of what would be expected with normal commuting variables. 

Turning now to the path integral, we temporarily consider w(x) as a finite num- 
ber of Grassmann variables corresponding to four Dirac indices at each point of 
spacetime (i.e., imagine that the spacetime label is discrete and finite). At each 
point, the transformation is from y > y’ 


o (x) = ehy (x), U(x) = y'a) FO”, (3.44) 


so that the overall jacobian has the form 
J = [det (c9%5)]~* [det (ec!) ]! (3.45) 


with one factor from each of the y and w variables. The determinant runs over the 
4 x 4 Dirac indices, the three flavors, colors, and also the spacetime indices. This 
is arather formal object, but can be made more explicit by using 


detC = et ne, (3.46) 
valid for finite matrices, to write 
J =e T, (3.47) 


The symbol tr denotes a trace acting over spacetime indices plus Dirac indices, 
flavors, and colors, 


tr Bys = Tr’ f ats (x|Bys|x), (3.48) 


with Tr’ indicating the Dirac, color, and flavor trace. This will become clearer 
through direct calculation below. 

The jacobian still is not regulated. Fujikawa suggested the removal of 
high-energy eigenmodes of the Dirac field in a gauge-invariant way. Consider, for 
example, the simple extension 


J= lim, exp |-2i tr (Bys daa ‘ (3.49) 


where M is the QCD covariant derivative. The insertion of a complete set of eigen- 
functions of P exponentially removes those with large eigenvalues. There has been 
an extensive literature demonstrating that other regularization methods produce the 
same results as Fujikawa’s, provided that the regulator preserves the vector gauge 
invariance. 
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In order to complete the calculation we employ the following identity: 


1 arwa 1 up” 
PP = 5u: Yv} D D + zin» yy|D D 


1 

= DD" + Fu. w1 [D", D] (3.50) 
830" v pa 

= D,D” + Fi Fin 


In this case the expression 
(x| exp —(@/M)°|x) (3.51) 


has the same form as given in Eqs. (B—1.1), (B—1.9), (B—1.17—18) with the identi- 
fications 


83 1 
dy = Di, o = qo N Fo T = M2 (3.52) 
Applying the calculation done there to our present situation yields 
J= lim e7 S dx Tr (B(x) ys H(x,M7?)) 
M->oo 
= iji ae J dx Tr (B(x) ys[M4a9+M7a\+a2+O(M~*)]) 3.99) 
M->oo 


The notation is defined in App. B-1. The first two traces vanish, leaving only the 
factor with two o“” matrices in a2. From the result 


Tr (yso o) = — Tr ysy” y” yy? = —4jch | (3.54) 


it is easy to calculate 


1 4 1 gata? vpra „æf pb 
I =op (25 fa x B(x) Tr (8 on Lo Te 


=1 4 ab . _uvaß 83 a pb 
=e (Ss Ja x B(x) 3-286 .4ie" 16 fH Pap (3.55) 
3a x 
= us dad’ 5 F° por” 
exp ( i | x B(x) rer ) 


where the trace Tr’ has produced factors for three flavors, color, and the Dirac 
trace. 

Although the calculation of the jacobian has been somewhat involved, we have 
succeeded in making sense out of what seemed to be a rather abstract object. The 
fact that it is not unity is an indication that the U (1) transformation is not a sym- 
metry of the theory. Applying Eq. (3.34) we see that 


Os pa aw (3.56) 


ng = “i f ax Bo I) = -i fats pot uv 
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or once again 


<i. “SOc 
ee Fe Pw, (3.57) 


The choice of a regulator which preserves the vector SU (3) gauge symmetry is 
important. Whereas in the Feynman diagram approach, we had the apparent free- 
dom to shift the integration variable to preserve either the vector or axial-vector 
symmetries, the corresponding freedom in the path-integral case is in the choice of 
regularization. 

If quark masses are included, the operator relation becomes 


30s a = 
atk. 358) 


Masses do not modify the coefficient of the anomaly, basically because it arises 
from the ultraviolet divergent parts of the theory, which are insensitive to masses. 

One does not have to go through these lengthy calculations for each new appli- 
cation of the anomaly. The anomalous coupling for currents 


VP = WT, AD = Puys TP Y, (3.59) 


where 7), T,® are matrices in the space of quark flavors, is of the form 


d IL (x) = 2i (m,ūysu + mad ysd + m,5ys5s) + 


De 
ava” = TE eE Fe Fag + mass terms, (3.60a) 
Ne 7: 
Di = oa (TOITA T), (3.60b) 


where Ne is the number of colors. In particular, for the electromagnetic coupling 
to the isovector axial current we have 


JS) = itypysu — dyyysd, 
N. (3.61) 
Dp = eN, Tr 30° = Be 
leading to the result already quoted in Eq. (3.6). 
The full content of the anomaly was given by Bardeen [Ba 69]. Consider a 
fermion with 7 internal degrees of freedom (flavor or color) coupled to vector and 
axial-vector currents Vy, d,,, 


L= Y ip — p — fys) Y. (3.62) 
These currents are in an 7 x 7 representation 


Vy = vit + via, ay = a l + ann. (3.63) 
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Thus, the axial current is i = WYy,yså* W, and the anomaly equation becomes 


1 1 1 
atJ” = mee Tr E (greves + Ay ap 


>H 4 12 
2i 2i 2i 8 
— 7 uA Vap — z Vuvlalg — y Ay vadp — Fyn) : 
Vuv = On Vv = Oy Up + i[Vp, vy] T ilap, a], 
apv = Juy — Day + i[vy,, ay] — i[v,, ay]. (3.64) 


This may also be expressed in terms of the left-handed and right-handed field 
tensors £,» and r,,, by using the identities, 


Luv = Only = Can + illn, L,)= Vuv a auv» 
Fw = Only = Chan + Tp r] = Vuv — Aus 
1 


1 v v 1 v v 
qn Pap + partes E (Cat tar” ) + a ar Fat ) . (3.65) 


In the language of Feynman diagrams, one encounters the anomaly contributions 
not only in the triangle diagram, but also in square and pentagon diagrams (e.g. 
from the a,,d,d,ag term). Our previous result, Eq. (3.57), is obtained for a, =0, 
Uy, = 83A% A* /2, with three flavors and three colors of quarks. 

We have seen that symmetries of the classical lagrangian are not always sym- 
metries of the full quantum theory. This is the general situation when there are 
anomalies. These appear in perturbation theory and are associated with divergent 
Feynman diagrams. This sometimes gives the mistaken impression that the dynam- 
ics has ‘broken’ the symmetry, and hence one might expect a massless particle 
through the application of Goldstone’s theorem. In the path-integral framework the 
impression is different. There the symmetry never exists in the first place, as the 
calculation performed above is simply the path-integral test for a symmetry, gener- 
alizing Noether’s theorem. Hence there is in general no expectation for a Goldstone 
boson. 

Can anomalies cause problems? When the anomaly occurs in a global symme- 
try, such as the above U(1) example, the answer is, ‘no’. They just need to be 
taken properly into account, e.g., as in Eq. (3.61). Given the specific form of the 
anomaly operator relation, there exist ‘anomalous Ward identities’ which contain 
terms attributable to the anomaly [Cr 78]. These anomalies can even be associated 
with a variety of specific phenomena. For example, in Sect. VII—6 we shall see how 
the decay 2° — yy is attributed to the axial anomaly. 

The presence of anomalies in gauge theories is far more serious because they 
destroy the gauge invariance of the theory and wreak havoc with renormalizability. 
Thus, one attempts to employ only those gauge theories which have no anomalies. 
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In some cases this can be arranged by ensuring, through the group or particle con- 
tent of the theory, that the coefficient D’“ of Eq. (3.60b) vanishes. For example, 
in the Standard Model it must be checked that this occurs for all combinations of 
the SU(3). x SU(2);, x U(1)y generators. These were already compiled in Eqs. 
(II—3.5a—c) and were seen to lead to the quantized fermion charge values observed 
in Nature. 
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If the fermion masses were zero in either QED or QCD, these theories would con- 
tain no dimensional parameters in the lagrangian, and they would exhibit a classical 
scale invariance. The associated quark and gluon scale transformations would be 
W(x) > Py (Ax) and A4 (x) > 4A%(A.x) for arbitrary 4. We saw in Sect. I-4 
that this leads to a traceless energy-momentum tensor, with conserved dilation 


H 
current Joie. 


H 
Joale 


Saye: JJ n = 0" = 0, (4.1) 


KL“ scale 


where 6“” is the energy-momentum tensor. Such a situation would have drastic 
consequences on the theory, since all single particle states would be massless. 
This can be seen as follows. For any hadron H, the matrix element of the energy- 
momentum tensor at zero-momentum transfer is 


(H(k) |0""| H(k)) = 2k"“k”, (4.2) 


where the normalization of states is chosen in accordance with the conventions 
defined in App. C-3. A vanishing trace would imply zero mass, i.e., 


(H(k) |6% | H(k)) = 0 = 2M}. (4.3) 


This is most obviously a problem in QCD where the quark masses are small com- 
pared to most composite particle masses.* We would not expect the proton mass to 
vanish if the quark masses were set equal to zero yet the scale-invariance argument 
implies that it must. 

A resolution is suggested by the method which is used to renormalize the 
theory. In practice, renormalization prescriptions introduce dimensional scales into 
the theory. Most commonly, there is the momentum scale at which one specifies 
the running coupling constant to have a particular value, e.g., a;(91 GeV) ~ 0.12. 
This in turn defines a scale A which enters the formula for the running coupling 
constant, Eq. (II-2.74). Thus, to fully specify QCD one needs to specify not only 
the lagrangian, but also a scale parameter, and the full quantum theory is not scale 


4 As can be justified, we neglect here the existence of very heavy quarks, c, b, and t. 
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invariant. Although this argument does not, at first sight, seem to nullify the reason- 
ing based on Noether’s theorem, it turns out that the trace of the energy-momentum 
tensor has an anomaly [Cr 72, ChE 72, CoDJ 77], and the specification of a scale 
and the coefficient of the anomaly are in fact related. 

In the following, let us start directly with the path-integral treatment [Fu 81], 
again in the framework of QCD, concentrating on the effect of a single quark. We 
can introduce an external source coupled to 0%, into the generating functional 


Zh, AS] = Ja dije S Hx lEocoth Ap +hee'y | (4.4) 
where 
guy = shy" By. (4.5) 


As in the case of the chiral anomaly, we can use this as a starting point to explore 
the nature of the trace 60%,. The key is that if one makes the change of variables 


yx) = ey (x), (4.6) 


one obtains for infinitesimal w 


l d*x [Loco (Y, A3 œ)) + a(x)" ] 
g g (4.7) 
= fas [Loco (W’, AS) tamy y + ip yuy aa]. 


The last term vanishes after an integration by parts. The focus of our calculation 
can thus be shifted to a jacobian J by a change of variable, 


Z [h +a, A‘ | = Ja dw ei T tx [Loco (W.A; )+h+0)0"h] 
= fa dy ei S tx [Coco (w. Ap) +ho'utamý' w'] (4.8) 
= Jaway J e S Px Loco (WA; )tho'utamb y] 


Thus, we obtain the identity 


if ax 6M a(x) =ing +i f a's myy a(x). (4.9) 
The form of the jacobian which follows from the work done in Sect. II[—3 is 
J = |det e = lim eT’ Six (|e exp -0/mM?|x) | (4.10) 
M->oo 


where we have adopted the same regulator as used previously. 
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The final result is easily obtained from the general heat-kernel calculation of 
App. B—1, again using the identities of Eqs. (B—1.17), (B—1.18). After some algebra 
this becomes 


jane: |exp —()/M)°*| x) 


iM* n AP (OO Des D] 
_ ot oÊ a pb 3 MN et 
=e E 32M Fu Pap + 12M7 

3i M4 ig? 
=I 4 183 pa pov (4.11) 


4n? 48m? M S 
Here we have found both a term which is a divergent constant, and one which 
involves two-gluon field strengths. The divergent constant corresponds to the infi- 
nite zero-point energy of the vacuum. This can be seen by noting that if the zero- 
point energy is defined by the vacuum matrix element 


(OIH &)10) = T = (0 |0 œ)| 0), (4.12) 
then Lorentz covariance requires a nonzero trace 
v Eo v Eo 
(0 16" (x)| 0) = + e = (0|6% (x)| 0) = a (4.13) 


Thus, a constant in the vacuum matrix element of the trace is just four times the 
zero-point energy density. It is standard practice to subtract off this zero-point 
energy, and we shall do so by dropping the constant term. This is similar to the 
procedure of normal ordering the energy-momentum tensor. 

If we now combine these results using Eq. (4.9), we obtain 


. 4 er 4 auv 
ifa x OM a(x) =i fa š E: a pau +miylew, (4.14) 
which is equivalent to the eee relation 
= c= F + muy. (4.15) 


One may also derive the trace anomaly via the calculation of Feynman dia- 
grams, the triangle diagrams of Fig. II-1, but with the axial current replaced 
by the energy-momentum tensor. The trace anomaly is different from the chiral 
anomaly in that it receives contributions also from gluons. In the Feynman diagram 
approach, this arises from the replacement of quark lines by gluons, while in the 
path-integral context it occurs when one considers scale transformations of the 
gluon field. A full calculation yields 


gu _ Bocp 
ho 223 


Fiw F°™ + m,ūu + madd + m;5s +- (4.16) 
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where Bgcp is the beta function of QCD (cf. Eq. (II-2.57b)). The result of our 
previous calculation, Eq. (4.15), corresponds to the lowest order contribution of a 
single quark to the beta function. 

A feeling of why the beta function enters can be obtained from an extremely 
simple, but heuristic, derivation of the trace anomaly. Let us rescale the gluon field 
to A; = 23 A> such that the massless action becomes 


1 —— 
L=- sa a iy: (4.17) 
The coupling constant g3 now enters only as an overall factor in the first term. 
However in renormalizing the coupling constant, we need to introduce a renormal- 
ization scale. If we interpret this coupling as a running parameter, the action is no 
longer invariant under scale transformations. Instead, taking à = 1 + 6A, we find 


5S j | 
2 / ee Fe paw = I dix PODES) pa paw (4.18) 
5A aa 4ga) " Zg; " 


where we have changed back to the standard normalization of A/, in the final term. 
By Noether’s theorem, the scale current is no longer conserved, and Eq. (4.16) 
is reproduced. The need to specify a scale in defining the coupling constant has 
removed the scale invariance of the theory. 

The trace anomaly occupies a significant place in the phenomenology of hadrons 
because it is the signal for the generation of hadronic masses. Returning to the 
discussion of masses which began this section, we see that the mass of a state is 
expressible as a matrix element of the energy-momentum trace. For example, we 


find for the nucleon state that 
myu(p)u(p) = (N(p)|6",|N (p)) 


= (Np ee F, Fl + m,5s + myiiu + madd|N(p)). (4.19) 


The terms containing the light quark masses m,, mg are expected to be small, and 
indeed the ‘o-term’ determined in x N scattering (cf. Sect. IU—3) implies that they 
contribute about only 45 MeV. This leaves the bulk of the nucleon’s mass to the 
gluon and s-quark terms in Eq. (4.19), of which the F/,, F““” part is expected to be 
dominant. Although this presents a conceptual problem for the naive quark model 
interpretation of the proton as a composite of three light quarks, it is nevertheless 
a central result of QCD. 


IWi-5 Chiral anomalies and vacuum structure 


There is a fascinating connection between the axial anomaly described previously 
in this chapter and the vacuum of QCD. This has important phenomenological 
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consequences for both the n’ mass and the strong CP problem. Here we present an 
introductory account of this topic [Pe 89]. 


The 0 vacuum 


One is used to considering the effect on gluon fields of ‘small’ gauge transforma- 
tions, i.e., those which are connected to the identity operator in a continuous man- 
ner. There also exist ‘large’ gauge transformations which change the color gauge 
fields in a more drastic fashion. For example the gauge transformation [JaR 76] 
generated by 


xX -d 2idt -X 
MO) =a erd 


(5.1) 


where d is an arbitrary parameter and t is an SU(2) Pauli matrix in any SU(2) 
subgroup of SU (3), transforms the null potential A(x) = 0 into 


A? (a) = -1 (Vj Ai(x)) Ap! (x) 
= e [t;a — x?) + 2x; (T - x) — 2d(x x t);| ? (5.2) 
83 (X 


Here, we are using the matrix notation 
— a 27 
A, =A Ta (5.3) 
This potential lies in an SU (2) subgroup of the full color SU (3) group, and is 
‘large’ in the sense that it cannot be brought continuously into the identity. The 
T -x factor couples the internal color indices to the spatial position such that a path 
in coordinate space implies a corresponding path in the SU(2) color subspace. 
All gauge potentials A,, carry a conserved topological charge called the winding 
number, 
i03 
= 8 
247? 


J dx Tr (Aj(x)Aj(x)Ag(x)) 7. (5.4) 


As can be demonstrated by direct substitution, the gauge field of Eq. (5.2) corre- 
sponds to the value n = 1. Fields with any integer value of the winding number n 
can be obtained by repeated applications of A; (x), viz., 


An(x) =A". (5.5) 


All gauge potentials can be classified into disjoint sectors labeled by their winding 
number. 


100 Symmetries and anomalies 


The existence of these distinct classes has interesting consequences. For exam- 
ple, consider a configuration of the gluon field that starts off at t = —oo as the zero 
potential A(x) = 0, has some interpolating A(x, t) for intermediate times, and ends 
up at t = +00 lying in the gauge-equivalent configuration A(x) = A‘? (x).° Then 
the following integral can be shown to be nonvanishing: 


83 
3272 


ger Ft). (5.6) 


a x 1 
4 a auv apv — 


This is surprising because the integrand is a total divergence. As noted previously 
in Eq. (II-2.23), F F can be written as 


: 1 
Fe E =a K", K" = [AS FE, + 383 fabe AVA; AS |, (5.7) 


and thus the integral can be written as a surface integral at t = -too. For the field 
configuration under consideration, this reduces to the winding-number integral 


83 4 = 83 4 
sane fx Fie = gga f raak" 


t=—00 


=1. (5.8) 


More generally, the integral of F F gives the change in the winding number 


83 = 83 
3 fats Ee pow = 3 [as K® 
327? j 327? 


t=00 


=n} —n_ (5.9) 


t=—00 


between asymptotic gauge-field configurations. 

Thus, the vacuum state vector will be characterized by configurations of gluon 
fields, which fall into classes labeled by the winding number. Moreover, there will 
exist a corespondence between the gauge transformations {A,,} and unitary oper- 
ators {U,,}, which transform the state vectors. For example, a vacuum state domi- 
nated by field configurations in the zero winding class (‘near’ to A, = 0) would be 
transformed by U; into configurations with a dominance of n = 1 configurations, 
or more generally, 


U,|n) =|n+1). (5.10) 


5 Such configurations are known to exist [Co 85]. 
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This implies that a gauge-invariant vacuum state requires contributions from all 
classes, such as the coherent superposition 


= > e iF in), (5.11) 


where @ is an arbitrary parameter. It follows from Eq. (5.10) that this @-vacuum is 
gauge-invariant up to an overall phase 


U,|0) = e!7|@). (5.12) 


The QCD vacuum must contain contributions from all topological classes. 


The 0 term 
Given this nontrivial vacuum structure, one requires three ingredients to completely 
specify QCD: (1) the QCD lagrangian, (2) the coupling constant (i.e. Agcp), and 
(3) the vacuum label 6. How can we account for the different vacua corresponding 
to different choices of 0? In a path-integral representation, the 0 = 0 vacuum would 
imply generic transition elements of the form 


out (9 = 0|X19 = 0) in = J [dA idy] [dy] Xei = $ ou(m|X|n)in- 


n,m 


(5.13) 


The presence of a nonzero 0 leads to an extra phase, 

wO a = Ye" A |) in- (5.14) 
However, this phase can be accounted for in the path integral by the addition of a 
new term to Sgcp. In particular we have, through the use of Eq. (5.9), 


38 fdtx Fe, Fow 


out (91 X1)in = / [dA dvd] X eet x2 
=) ee our (rm |X In), 


n,m 


(5.15) 


where X is some operator. We see that the quantity (m — n) given by the winding- 
number difference of the fields contributing to the path integral is equivalent to a 
new exponential factor containing Fg, Fow, Thus, a correct procedure for doing 
calculations involving 6 vacua is to follow the ordinary path-integral methods but 
with a QCD lagrangian containing the new term 


Lon = £85) +68 83 pa pam, (5.16) 


aoa A 
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The parameter 0 is to be considered a coupling constant. Since the operator F F is 
P-odd and T-odd, a nonzero 0 can induce measurable T violation. In Sect. IX—4, 
we shall show how to connect 0 to physical observables. There is an important dis- 
tinction between the various 0 vacua of QCD and the many possible vacuum states 
of a spontaneously broken symmetry such as the Higgs sector of the electroweak 
theory. In the latter case, the various possible vacuum expectation values of the 
Higgs field label different states within the same theory. In contrast, each value of 
0 corresponds to a different theory, just as each value of Agcp would label a dif- 
ferent theory. Specifying 6 and Agcp then specifies the content of the version of 
QCD used by Nature. 


Connection with chiral rotations 


There is a connection between the axial anomaly and the presence of a 0 vacuum 
[tH 76a,b]. It involves the matrix element of F F as follows. Consider the limit of 
Ny massless quarks. The U (1) axial current 


Nf 
0 = 
i = So him (5.17) 
j=l 
is not conserved due to the anomaly, 
N fats 


eS ea pee (5.18) 


However, because of the fact that F F is a total divergence, one can define a new 
conserved current 


Nea 
(0) fUs 
Jsu = Is — ie Rus (5.19) 
While Js, does form a conserved charge, 
Qs = J Px Js.o(X), (5.20) 


neither Qs nor Jsu is gauge-invariant. In fact, under the gauge transformation A, 
of Eq. (5.1), it follows from Eq. (5.8) that the operator Q5 changes by a c-number 
integer 

Ui Õ5;U7' = Os —2Ny. (5.21) 
This tells us that in the world of massless quarks, the different 6-vacua are related 
by a chiral U (1) transformation, 


U1 e'25|0) = Ues UT! Uy |) = ef O28) e425 19), (5.22) 
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or, from Eq. (5.12), 


ei" 9519) = |0 —2N a), (5.23) 


where œ is a constant. Therefore, in the limit of massless quarks, when Os isa 
conserved quantity, all of the @ vacua are equivalent and one can transform away the 
0 dependence by a chiral U (1) transformation. The same is not true if quarks have 
mass, as the mass terms in £gcp are not invariant under a chiral transformation. 
We shall return to this topic in Sect. IX-4. 

To summarize, one finds that the existence of topologically nontrivial gauge 
transformations, and of field configurations which make transitions between the 
different topological sectors of the theory, leads to the existence of nonvanishing 
effects from a new term in the QCD action. Chiral rotations can change the value 
of 0, allowing it to be rotated away if any of the quarks are massless. However, 
for massive quarks, the net effect is a measurable CP-violating term in the QCD 
lagrangian. 


III-6 Baryon- and lepton-number violation in the Standard Model 


An even more dramatic effect arises from an anomaly in the current for the total 
baryon plus lepton number (B + L). Baryon number appears to be a conserved 
quantity when Noether’s theorem is applied to the lagrangian of the Standard 
Model, as is total lepton number.° The invariances are 


q —> eg, L— eg (6.1) 


for all quarks q and leptons £. The corresponding currents involve the sum over all 
quarks and leptons 


1 - 
J“ = ~(ay" dvtid+.-- 
$ 3 ily u+dy"d+---) (6.2) 


Ji = ey"e + DeL Y” VeL Se aes 


where the normalization of the baryon current is chosen to give a baryon a charge 
of +1. 

The baryon current is vectorial, and naively might not be expected to have an 
anomaly. However, the coupling of the quarks to the SU(2); and U(1)y gauge 
bosons violates parity, so that there are VVA triangle diagrams involving the 
baryon current with two gauge currents. For example, the triangle diagram 


6 Tf there are neutrino Majorana masses, lepton number will be violated. However, this is independent of the 
anomaly effect discussed in this section. Majorana masses will be discussed in Chap. VI. 
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involving the baryon current with the U(1)y hypercharge current has a VV A tri- 
angle involving the quantum number sum 


Tr(B(¥r + Yr) (YL — Yr)) = —2 (6.3) 


where B = 1/3 for quarks and B = 0 for leptons. These diagrams then yield an 
anomaly. Because the axial current of this triangle is a gauge current, any gauge- 
invariant regularization of the triangle diagram will place the anomaly in global 
baryon-number current even though it is vectorial (see the discussion surrounding 
Eq. (3.22)). Similar anomalies occur in the lepton number current.’ The anomalies 
cancel if we take the difference of the baryon and lepton currents, with the resulting 
anomaly equations 


ðL (JË — JH) =0 
(6.4) 


pIE + IE) = os (eh Fi Ft” — gB B). 


3272 
Here we see that, because of the anomaly, baryon number is in fact not conserved 
in the Standard Model, although B — L is. 

However, the baryon-number violation due to the anomaly is unmeasurably small 
at low temperature. Any transition that would change baryon number is non- 
perturbative in nature, as it is not seen in the usual perturbative Feynman rules. 
In weakly coupled field theory, such nonperturbative phenomena are suppressed in 
rate by a factor [’tH 76b] 


[e787/83]2 ey 107160, (6.5) 


so that such transitions are unobservable. 

At high temperatures the situation is different [KuRS 85]. The classical solu- 
tion mediating a transition which changes baryon number, a sphaleron [KIM 84], 
is known in the limit 6,, — O and the corrections due to a nonzero 6, can be 
estimated. The solution has an energy around Espn ~ 10 TeV, taking into account 
the measured Higgs-boson mass. At high temperature, thermal effects can cause 
transitions with a Boltzmann factor e~*%'/", and at very high temperatures all 
suppressions disappear and the rate per unit volume scales with the temperature 
T/V ~T’. 

This has an important consequence — at equilibrium in the early Universe an 
initial excess of baryons can disappear. More precisely, the equilibrium value of 
B + L is zero at high temperature. However, B — L is still conserved, so that an 
initial excess of B — L will be preserved. 


7 Because possible right-handed neutrinos have no gauge couplings, their presence would not modify the 
anomaly. 


Problems 105 


It is natural to ask if a sufficiently large baryon asymmetry in the Universe can 
be generated by out-of-equilibrium processes near the electroweak phase transi- 
tion, using only Standard Model interactions. The answer appears to be negative 
[GaHOP 94], as the necessary CP violation within the Standard Model is too small 
and the phase transition is not strong enough. New interactions near the weak scale 
could provide the needed extra physics. Alternatively, the residual baryon asymme- 
try may arise from a net B — L generated in the Universe before the electroweak 
epoch. Within the context of the Standard Model interactions, the simplest such 
possibility is leptogenesis involving heavy right-handed neutrinos with Majorana 
masses. This mechanism will be discussed in Sect. VI-6. 


Problems 


(1) Currents and anomalies 
(a) Verify that all currents coupled to gauge bosons in the Standard Model are 
anomaly free. 
(b) Find the relative strength of the anomaly coupling of the baryon number 
current to the SU(2); and U(1)r gauge bosons. 
(2) Trace anomaly in QED 
In d dimensions, the trace of the energy-momentum tensor does not vanish 
classically, except at d = 4. For example, in massless QED the energy- 
momentum tensor, 


1 j 
0” = — Fi FY + qo 8 Be + shy" Dy, 


has trace ok = TAF %0 F, o. In the renormalization of the operator F”? F,,,, 
one encounters a renormalization constant which diverges as d — 4. Use this 
feature to calculate the QED trace anomaly using dimensional regularization. 


IV 


Introduction to effective field theory 


The purpose of an effective field theory is to represent in a simple way the dynam- 
ical content of a theory in the low-energy limit. One uses only those light degrees 
of freedom that are active at low energy, and treats their interactions in a full 
field-theoretic framework. The effective field theory is often technically non- 
renormalizable, yet loop diagrams are included and renormalization of the physical 
parameters is readily accomplished. 

Effective field theory is used in all aspects of the Standard Model and beyond, 
from QED to superstrings. Perhaps the best setting for learning about the topic is 
that of chiral symmetry. Besides being historically important in the development of 
effective field theory techniques, chiral symmetry is a rather subtle subject, which 
can be used to illustrate all aspects of the method, viz., the low-energy expansion, 
non-leading behavior, loops, renormalization and symmetry breaking. In addition, 
the results can be tested directly by experiment since the chiral effective field theory 
provides a framework for understanding the very low-energy limit of QCD. 

In this chapter we introduce effective field theory by a study of the linear sigma 
model, and discuss the generalization of these techniques to other settings. 


IV-1 Effective lagrangians and the sigma model 


The linear sigma model, introduced in Sects. I-4, I-6, provides a ‘user friendly’ 
introduction to effective field theory because all the relevant manipulations can be 
explicitly demonstrated. The Goldstone boson fields, the pions, are present at all 
stages of the calculation. It also introduces many concepts which are relevant for 
the low enegy limit of QCD. However, low-energy QCD is far less transparent, 
involving a transference from the quark and gluon degrees of freedom of the origi- 
nal lagrangian to the pions of the physical spectrum. Nevertheless, the low-energy 
properties of the two theories have many similarities. 
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The first topic that we need to describe is that of an “effective lagrangian’. First, 
let us illustrate this concept by simply quoting the result to be derived below. Recall 
the sigma model of Eq. (4.14), 


- 1 1 
L= hipy + 5am - dx + zuo do 
g 1 i 
— giv (0 -it ayy tze (o> +m?) -F (or +H)’. (1.1) 


This is a renormalizable field theory of pions, and from it one can calculate any 
desired pion amplitude. Alternatively, if one works at low-energy (E < u), then 
it turns out that all matrix elements of pions are contained in the rather different 
looking ‘effective lagrangian’ 


F? 
Leg = Tr (a,Ua"U'), U =expit-x/F, (1.2) 


where F = v = y u?/à at tree level (cf. Eq. (I-6.9)). This effective lagrangian is 
to be used by expanding in powers of the pion field 
1 

6F2 
and taking tree-level matrix elements. This procedure is a relatively simple way of 
encoding all the low-energy predictions of the theory. Moreover, with this effective 
lagrangian is the starting point of a full effective field theory treatment including 
loops, which we will develop in Sect. IV-3. 


L= sour -Əm + (x -dyry — x? (pre + arm) | pesg (3) 


Representations of the sigma model 


In order to embark on the path to the effective field theory approach, let us rewrite 
the sigma model lagrangian as 


1 u À 
L= 3 Tr (EdE) + Z Tr (2'E) - pusat 
+ Writ + Prip Yr — g (YLEYr + PRE YL), (1.4) 


with & = ø + it - a. The model is invariant under the SU (2); x SU (2)r trans- 
formations 


Yr > Ly, Wr —> Rwre, SoS LER (1.5) 


for L, R in SU (2). This is the linear representation. ! 


1 A number of distinct 2 x 2 matrix notations, among them £, U, and M, are commonly employed in the 
literature for either the linear or the nonlinear cases. It is always best to check the definition being employed 
and to learn to be flexible. 
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After symmetry breaking and the redefinition of the o field, 


o=v+0, v=,/—, (1.6) 
the lagrangian reads” 
1 1 
L= (3 õ3"ő — 2u’õ?) + 59% O'm — vs (6? +x?) 


À 7 _ 
er es n?) +y (ip — gv) y — gh (6 —it- mys) y, (1.7) 


indicating massless pions and a nucleon of mass gv. All the interactions in the 
model are simple nonderivative polynomial couplings. 

There are other ways to display the content of the sigma model besides the above 
linear representation. For example, instead of o and m one could define fields S 
and 9, 


vIt 


= 
Vf ( +v) + 2? 


where one expands in inverse powers of v. For lack of a better name, we can call 
this the square-root representation. The lagrangian can be rewritten in terms of the 
variables S and g as 


k= ; | (a5) 7 248°] us ; ( =) ae! =F us ael] 


ET =at-, 


(1.8) 


v2 — g? 
v+S 
v 


-aws — 54+ Fiv -e ( ) [e-o -ioen y. 


(1.9) 
Although this looks a bit forbidding, no longer having simple polynomial interac- 
tions, it is nothing more than a renaming of the fields. This form has several inter- 
esting features. The pion-like fields, still massless, no longer occur in the potential 
part of the lagrangian, but instead appear with derivative interactions. For vanishing 
S, this is called the nonlinear sigma model. 


Another nonlinear form, the exponential parameterization, will prove to be of 
importance to us. Here the fields are written as 


E =0 +it- m = (v+ S)U, U = exp (it - x'/v) (1.10) 
such that x’ = m +---. Using this form, we find 


2 Here, and in subsequent expressions for £, we drop all additive constant terms. 
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Fig. IV-1 Contributions to m"x? elastic scattering. 


2 
f= ; [0s = 278? | + on Tr (d,Ua"U") 


À = = = ; 
— AvS? — 7S" + wip -gu + sS) (WLU Wr + WRU). (1.11) 
The quantity U transforms under SU (2); x SU (2) in the same way as does &, i.e., 
U > LU Rİ. (1.12) 


This lagrangian is reasonably compact and also has only derivative couplings for 
pions. 


Representation independence 


We have introduced three sets of interactions with very different appearances. They 
are all nonlinearly related. In each of these forms the free-particle sector, found by 
looking at terms bilinear in the field variables, has the same masses and normal- 
izations. To compare their dynamical content, let us calculate the scattering of the 
Goldstone bosons of the theory, specifically 7*2° —> m+z°. The diagrams that 
enter at tree level are displayed in Fig. [V—1. The relevant terms in the lagrangians 
and their tree-level scattering amplitudes are as follows. 


(1) Linear form: 


Xr 
Lr= z (x?) — Aven’, 
iMgrtn0srtr0 = —2ià + (—2iAv)? = 
q“ — ms 
207 iq? 
= —2id} 1 = e, 1.13 
i | + v? = ee 


where q = p', — p+ = po — po and the relation m2 = 2Av? = 2u? has been 
used. The contributions of Figs. IV-1(a), 1(b) are seen to cancel at q? = 0. 
Thus, to leading order, the amplitude is momentum-dependent even though 
the interaction contains no derivatives. The vanishing of the amplitudes at zero 
momentum is universal in the limit of exact chiral symmetry. 
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(2) Square-root representation: 


2 
Fics 1 (9 : du) 
"2 (= 9) 
For this case, the contribution of Fig. [V—1(b) involves four factors of momen- 
tum, two at each vertex, and so may be dropped at low-energy. For Fig. [V—1(a) 


S 
+—d,9-0"9. (1.14) 
v 


we find 
Li spn (PP up + ue +9 ae) , 
TENTE un p) _ ie ie (1.15) 
(3) Exponential representation: 
Lı = Ga (Ə U3 UŻ) +. (1.16) 


Again Fig. IV-1(b) has a higher-order (O(p*)) contribution, leaving only 
Fig. IV-1(a), 


Ly [e dye’) — x? (2° - an’) | 


~ 6v2 
i (P, — psy 


- ng (1.17) 


IM rerin Eno = 
v 


The lesson to be learned is that all three representations give the same answer 
despite very different forms and even different Feynman diagrams. A similar con- 
clusion would follow for any other observable that one might wish to calculate. 
The above analysis demonstrates a powerful field-theoretic theorem, proved first 
by R. Haag [Ha 58, CoWZ 69, CaCWZ 69], on representation independence. It 
states that if two fields are related nonlinearly, e.g., p = x F(x) with F(O) = 1, 
then the same experimental observables result if one calculates with the field ọ 
using £ (gy) or instead with x using £ (x F(x)). The proof consists basically of 
demonstrating that (i) two S-matrices are equivalent if they have the same single 
particle singularities, and (ii) since F(O) = 1, g and x have the same free field 
behavior and single-particle singularities. This result can be made plausible if we 
think of the scattering in non-mathematical terms. If the free particles are isolated, 
they have the same mass and charge and experiment cannot tell the ọ particle from 
the x particle. At this level they are in fact the same particles, due to F(O) = 1. 
The scattering experiment is then performed by colliding the particles. The results 
cannot depend on whether a theorist has chosen to calculate the amplitude using 
the ọ or the x names. That is, the physics cannot depend on a labeling convention. 
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This result is quite useful as it lets us employ nonlinear representations in situ- 
ations where they can simplify the calculation. The linear sigma model is a good 
example. We have seen that the amplitudes of this theory are momentum-dependent. 
Such behavior is obtained naturally when one uses the nonlinear representations, 
whereas for the linear representation more complicated calculations involving 
assorted cancelations of constant terms are required to produce the correct momen- 
tum dependence. In addition, the nonlinear representations allow one to display the 
low-energy results of the theory without explicitly including the massive o (or S) 
and y fields. 


IV—2 Integrating out heavy fields 


When one is studying physics at some energy scale E, one must explicitly take into 
account all the particles which can be produced at that energy. What is the effect of 
fields whose quanta are too heavy to be directly produced? They may still be felt 
through virtual effects. When using an effective low-energy theory, one does not 
include the heavy fields in the lagrangian, but their virtual effects are represented 
by various couplings between light fields. The process of removing heavy fields 
from the lagrangian is called integrating out the fields. Here, we shall explore this 
process. 


The decoupling theorem 


There is a general result in field theory, called the decoupling theorem, which 
describes how the heavy particles must enter into the low-energy theory [ApC 75, 
OvS 80]. The theorem states that if the remaining low-energy theory is renormal- 
izable, then all effects of the heavy-particle appear either as a renormalization 
of the coupling constants in the theory or else are suppressed by powers of the 
heavy-particle mass. We shall not display the formal proof. However, the result is 
in accord with physical expectations. If the heavy particle’s mass becomes infinite, 
one would indeed expect the influence of the particle to disappear. Any shift in 
the coupling constants is not directly observable because the values of these cou- 
plings are always determined from experiment. Inverse powers of heavy-particle 
mass arise from propagators involving virtual exchange of the heavy particle. 

In the Standard Model, the most obvious example of this is the role played in 
low-energy physics by the W~ and Z gauge bosons. For example, while W~-loops 
can contribute to the renormalization of the electric charge, the effect cannot be 
isolated at low energies. Also, the residual form of W~-exchange amplitudes is that 
of a local product of two weak currents (Fermi interaction) with coupling strength 
Gr. Its effect is suppressed because Gr x M oe 
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However, in the Standard Model there is an example where the heavy-particle 
effects do not decouple. For a heavy top quark, there are many loop diagrams which 
do not vanish as m, — ov, but instead behave as m? or In(m?). This can occur 
because the electroweak theory with the ¢ quark removed violates the SU (2), 
symmetry, as the full (5) doublet is no longer present. Without the constraint of 
weak-isospin symmetry, the theory is not renormalizable and new divergences can 
occur in flavor-changing processes. These would-be divergences are cut off in the 
real theory by the mass m,. Note that at the same time as m, — od, the top quark 
Yukawa coupling also goes to infinity, and hence induces strong coupling, which 
can also lead to a violation of decoupling. 

In the sigma model, all the low-energy couplings of the pions are proportional 
to powers of 1/v? œ 1/m2, the simplest example being Eq. (1.9). Hence the effec- 
tive renormalizable theory is in fact a free field theory, without interactions. The 
interactions have been suppressed by powers of heavy-particle masses. We shall 
use the energy expansion of the next section to organize the expansion in powers 
of the inverse heavy mass. 


Integrating out heavy fields at tree level 


The name of this procedure comes from the path-integral formalism, where the 
process of integrating out a heavy field H and leaving behind light fields £; is 
defined in terms of an effective action Wer[2;], 


Ze] = e! Werle] = J [dH] ei Í Px LHO) (2.1) 


However, the procedure is equally familiar from perturbation theory, in which the 
effect of the path integral is represented by a sum of Feynman diagrams. 

Let us proceed with a path-integral example. Consider a linear coupling of H to 
some combination of fields J, with the lagrangian 


1 
L=3 (ðL Ha" H — m4 H’) + JH. (2.2) 
One way to integrate out H is to ‘complete the square’, i.e., we write 


[as L(A, J) = [as |-;HDH + JH 


= -5 f a's [((H-D"J)D(H-D"J)-JD"'J] 


= -5 fats [H'DH' — JDJ], (2.3) 


where we have used the shorthand notations, 
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D=0+m}, 
D!J = - f ay Ar(x — y) JY), 
(O, + my) Ara — y) = 8t — y), 


H'(x) = H(x) + fay Ar(@ — y)JQ), 
[as JDJ = - fats d*y J(x)Ar(x —y)J(y), 24 


and have integrated by parts repeatedly. Since we integrate in the path integral 
over all values of the field at each point of spacetime, we may change variables 
[dH] = |[dH’]so 


Z[J] = ei Werl] = fimestsean 
E jie ee 
= Z[0] etfs ID (2.5) 
where 
Z[0] = J o a (2.6) 


Here, Z[0] is an overall constant that can be dropped from further consideration. 
From this result we obtain the effective action 


1 
Wenl J] = —5 J d*x d*y J (x) Ar(x — y) J Y). (2.7a) 


This action is nonlocal because it includes an integral over the propagator. How- 
ever, the heavy-particle propagator is peaked at small distances, of order 1/m%4. 
This allows us to obtain a local lagrangian by Taylor expanding J(y) as 


Jy) =I + (y— 3)" [SO], to (2.7b) 


Keeping the leading term and using 


1 
Ja Ar& -= y) =--> (2.8) 
H 


we obtain 


Weel J] = J d’x 


1 
m I(x) I) +, (2.9) 
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where the ellipses denote terms suppressed by additional powers of my. Outside 
of the path-integral context, this result is familiar from W-exchange in the weak 
interactions. 


Matching the sigma model at tree level 


We can apply this procedure to the lagrangian for the sigma model, where the scalar 
field S is heavy with respect to the Goldstone bosons. Thus, considering the theory 
in the low-energy limit, we may integrate out the field S. Referring to Eq. (1.11) 
and neglecting the $? interactions, it is clear that we should make the identifications 
H —> Sand J > vTr(d,Ud“U")/2. The effective lagrangian then takes the 
form 


9 2 
Lot = — Tr (8,00"U) + —, [Tr (,Ua"UT)] +--+, (2.10) 
4 8m5 


where the second term in Eq. (2.10) is the result of integrating out the S-field and 
gives rise to the diagram of Fig. [V—1(b). Additional tree-level diagrams are implied 
by the sigma model when one includes the $° and S* interactions. Since these carry 
more derivatives, the above result is the correct tree-level answer with up to four 
derivatives. 

This calculation is an illustration of the concept of ‘matching’, here applied at 
tree level. We match the effective field theory to the full theory in order to reproduce 
the correct matrix elements. From the starting point of Eq. (1.11), we expect that 
there will be a low-energy effective lagrangian, which is written as an expansion 
in powers of Tr (0,,Ud“U"), with coefficients that are initially unknown. In the 
matching procedure, we choose the coefficients to be those appropriate for the full 
theory. 

In calculating transitions of pions, this is then used by expanding the U matrix 
in terms of the pion fields and taking matrix elements. At the lowest energies, 
only the lagrangian with two derivatives is required, justifying the result quoted 
in Eq. (1.2). Interested readers may verify that the two terms in Eq. (2.10) repro- 
duce the first two terms in the z+? scattering amplitude previously obtained in 
Eq. (1.13). However, we have gained a great deal by using the effective lagrangian 
framework, because now all matrix elements of pions can be calculated simply to 
this order in the energy by simply expanding the effective lagrangian and reading 
off the answer. 


3 We will show that this term is not modified by loop effects, aside from the renomalization of the parameter v. 
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IV—3 Loops and renormalization 


The treatment above has left us with a nonlinear effective lagrangian of the form 
that is called ‘non-renormalizable’. It is also incomplete because loop diagrams 
have not yet been considered. One might worry that because the effective lagrangian 
is non-renormalizeable, loops would cause trouble. However, that is not the case. 
Indeed, this situation helps demonstrate the ‘effectiveness’ of effective field 
theory — we will see that the important loop processes are reproduced in a sim- 
pler manner using the effective field theory. 

Continuing our treatment of the linear sigma model, let us display the precise 
formal correspondence between the full theory and the effective theory. If we are 
only considering matrix elements involving the light pions, we can write the path 
integral defining the theory‘ as 


Z{j] = vf [dx (x)] [do (x)] exp f J d*x (L[m(x), o(x)] +j(x) - mx)| ; 
(3.1) 


When working at low energies, we can then integrate out the heavy field o to 
produce the effective theory 


Z{j] = vf [dm (x)] exp | [as (Leet [a (x)] + j(x) - mx) ; (3:2) 


Because the o field is heavy, its influence will not propagate far and the resulting 
effective lagrangian will be local. However, this correspondence emphasizes the 
fact that one is still left with a full field theory. It is not only at tree level that the 
effective lagrangian must be applied. Loop processes must also be considered, as 
is the case in any field theory. The original theory involves both o and z loops, 
while the effective theory has only the z loop diagrams. We will demonstrate how 
to match the effective theory to the full theory through an explicit calculation. 

In order to accomplish the renormalization and matching procedure for the effec- 
tive theory we will need a lagrangian similar to the tree level form, but with initially 
unknown coefficients that will be chosen later, i.e., 


2 
Le = Te (0U 3U) 
+ €\[Tr (0,Ud"U")P + Tr (4,Ud,U") Tr (a“UA"U"). (3.3) 


This is the most general form consistent with the symmetry U —> LU RÌ, contain- 
ing up to four derivatives. The first portion of this lagrangian, when expanded in 
terms of the pion field, yields the usual pion propagator as well as the lowest-order 
result for the z7 scattering amplitudes. 


4 Recall that o = S in some previous formulas. 
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Fig. IV-2 A subset of one-loop diagrams contributing to 7 +z° elastic scattering. 


Let us again consider the process xt + 2° —> 2+ + 7°, this time to one loop. 
The full linear sigma model is renormalizeable and will yield finite predictions 
in terms of the (renormalized) parameters of the theory. The effective theory has 
been constructed to have the same vertices at the lowest energies, but will have 
quite different high-energy properties because it is missing the extra high-energy 
degree of freedom. There will be new divergences present in perturbation theory. 
However, the low-energy effects will be similar in both calculations. 

For example, consider the set of diagrams depicted in Fig. [V—2. In the full 
theory, all of these diagrams exist, and our previous result of Eq. (1.13) can be 
used to write the combined amplitudes as 


iM J A 2ià + (—2i àv)? : 
iMen = | —— | -21 2v) ————_,_ | —--—— = 
ii (2r) (k+ p+)? — m2 | (k+ p+ po? K 


EEEE E 
| 2ià + (—2iAv) ea: (3.4) 


The result is a sum of bubble, triangle, and box diagrams. The box in particular 
is a very complicated function of the kinematic invariants, involving di-logarithms 
[tHV 79, DeNS 91, EIZ 08]. The divergence from the bubble diagram goes into 
the renormalization of the 4 coupling of the original lagrangian. For the effective 
theory, in contrast, one uses only pions and considers only the bubble diagram. 
The low-energy limit of the vertex is employed. Again, drawing from our results 
of Eq. (1.13), also visible by taking the leading approximation for the vertices in 
Eq. (3.4), one finds 


=f dtk i(k + ps) i i i(k+ ply (3.5) 
~ J Qm) v? (k + p+ + po)? k? v? l l 


This diagram has a different divergence than the full theory. Itis also much simpler 
kinematically, and its dimensional regularized form is easily evaluated as 
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. i 2 —s — i€ 
iMee = o6n2 — u) E — y +ln4x — 1n E | 


l 
—— [P= : 
+ 288724. s^ — 5su], (3.6) 


using the usual variables s = (p+ + po)”, t = (p+ — p’,)*, u = (po — py. 

There are various interesting features of this result. Note that the whole ampli- 
tude is of order (energy)*, while the original scattering vertex of Eq. (1.13) was 
of order (energy). Technically, this follows simply from noting that the loop has 
factors of 1/v‘* and that in dimensional regularization the only other dimensional 
factors are the external energies. On a more profound level it is an example of the 
energy expansion of the effective theory — loops produce results that are suppressed 
by higher powers of the momenta at low-energy. Because of this kinematic depen- 
dence, one can also readily see that the divergence cannot be absorbed into the 
renormalization of the original O(E”) effective lagrangian. In fact we know that 
this divergence is spurious. It was generated because the effective theory had the 
wrong high-energy behavior compared to the full theory. This is to be expected in 
an effective theory — it does not pretend to know the content of the theory at all 
energies. However, the divergence will disappear in the matching of the two theo- 
ries through the renormalization of a term in the O(E*) lagrangian — this will be 
demonstrated below. 

Even more interesting from the physics point of view is that the s(s — uw) In—s 
behavior is exactly what is found by taking the low-energy limit of the compli- 
cated result from the full theory and expanding it to this order in the momenta. 
This occurs because the In—s factor comes from the low-energy regions of the 
loop momenta, of order k ~ s, so that the logarithm represents long-distance prop- 
agation.° Indeed, the imaginary part of the amplitude arising from In(—s — ie) = 
In(s)—iz (for s > 0) comes from the on-shell intermediate state of two pions. This 
logarithm could never be represented by a local effective lagrangian and is a dis- 
tinctive feature of long-distance (low-energy) quantum loops. These features match 
in the two calculations because when the loop momenta are small the effective 
field theory approximation for the vertex is valid. Overall, the effective field 
theory has an incorrect high-energy behavior but does capture the correct low- 
energy dynamics. 

The comparison of the full theory and the effective theory can be carried out 
directly for this reaction. The dimensionally regularized result for the full theory 
is given in [MaM 08], but is too complicated to be reproduced here. However the 


5 Short distance pieces from higher values of k would be analytic functions able to be Taylor expanded around 
s=0. 
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expansion of the full theory at low-energy in terms of renormalized parameters is 
relatively simple [GaL 84] 


t 1 11 
Mean = mt | = Je 


mv? 96r?vt 


rre —w) + uw — 9] 


a? in el yl z 3.7 
P m2 sS io = G7) 


o o o 


1 
967? v4 
The effective theory result [Le 72, GaL 84] has a very similar form but does not 
know about the existence of the o, 


Mer = + | 86 +26 + : 2 
a 1 2" 19272 | v4 


f 4 
+ E + Ter aa | [s(s — u) + u(u — s)]/v (3.8) 
— = [s?in + s(s -Dh +u(u -9m2 | 
96724 u u? u? i 
where we have defined? 
© =h + 3 [zarta] 
384r? |4—d 
6 = h + — : Jagr tina]. (3.9) 
19272 |4—d 


At this stage we can match the two theories, providing identical scattering ampli- 
tudes to this order, through the choice 


z U 1 m? 35 
t= + In — 


8m2  192mr?| 2 6 
e E (3.10) 
— n = é è 
2 38472 ur 6 


The reader is invited to compare this result with the tree-level matching, Eq. (2.10). 
We have not only obtained a more precise matching, we also have generated impor- 
tant kinematic dependence, particularly the logarithms, in the scattering 
amplitude. 

We have seen that the predictions of the full theory can be reproduced even 
when using only the light degrees of freedom, as long as one chooses the coeffi- 
cient of the effective lagrangian appropriately. This holds for all observables. Once 


6 Readers who compare with [GaL 84] should be aware that our normalization of the £; coefficients differs by 
a factor of four. 
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the matching is done, other processes can be calculated using the effective theory 
without the need to match again for each process.’ The total effect of the heavy 
particle, both tree diagrams and loops, has been reduced to a few numbers in the 
lagrangian which we have deduced from matching conditions to a given order in 
an expansion in the energy. 

In this example we match to a known calculable theory. In other realizations of 
effective field theory, the full theory may be unknown (for example, in the case 
of gravity [Do 94]) or very difficult to calculate (as we will discuss for QCD). In 
cases where direct matching is not possible, the renormalized coefficients in the 
lagrangian could be determined through measurement. Measuring the value of the 
coefficients in one reaction would allow them to be used by the effective theory in 
other processes. 


IV-—4 General features of effective field theory 


After this explicit example, let us think more generally about effective field theo- 
ries. In quantum mechanics and quantum field theory, we face what appears to 
be an impossible situation. Intermediate states in perturbation theory and in loop 
diagrams include all energies, even beyond those which have been probed experi- 
mentally. Yet we expect more new particles and new interactions to be present 
eventually at higher energies. How can we then reliably perform any calculation 
without knowing the particles and interactions at all energies which enter in our 
calculations? 

The answer essentially comes from the uncertainty principle. Effects from high 
energy appear local when viewed at low energy. This means that they are equivalent 
to terms in a local lagrangian. Most often the coefficient of a particular term in a 
lagrangian — a mass or a coupling constant — is something that we have to measure. 
So the effects of physics from high energy is contained in the parameters that we 
measure at low energy. 

Effective field theory embraces this fact and uses it to perform calculations at 
low energy. In theories where the high-energy limit is known, such as our linear 
sigma model example above, the coefficients of the effective lagrangian can be 
determined by matching. In theories where the high-energy physics is not known, 
we still know that its effect is local, so that we parameterize it by the most general 
local lagrangian. 

The decoupling theorem tells us that the high-energy effect appears in renormal- 
ized couplings or in terms suppressed by powers of the heavy scale. In this sense, all 
of our theories can be viewed as effective field theories. The class of renormalizable 


7 As part of our treatment of QCD, we show the universality of the renormalization in Appendix B-2. 
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field theories is a subset of effective field theories in which the power-suppressed 
lagrangians have not yet been needed. 


Effective lagrangians and symmetries 


What would happen if, instead of having a straightforward known theory like the 
linear sigma model, we were dealing with an unknown or unsolvable theory with 
the same SU(2); x SU (2) chiral symmetry? In this case there would exist some 
set of pion interactions which, although not explicitly known, would be greatly 
restricted by the SU (2) chiral symmetry. Once again we could choose to describe 
the pion fields in terms of the exponential parameterization U, with a symmetry 
transformation 


U —> LUR' (4.1) 


for L, R in SU (2). Not having an explicit prescription, we would proceed to write 
out the most general effective lagrangian consistent with the chiral symmetry. In 
view of the infinite number of possible terms contained in such a description, this 
would appear to be a daunting process. However, the energy expansion allows it to 
be manageable. 

It is not difficult to generate candidate interactions which are invariant under 
chiral SU (2) transformations. For the purpose of illustration, we list the following 
two-derivative, four-derivative, and six-derivative terms in the exponential param- 
eterization, 


Tr (ð UƏtUŤ), Tr (3 U3 UŤ) - Tr (Ə U3”Uİ), 
Tr (,Ua"U") - Tr (8,UDa"U') . (4.2) 


There can be no derivative-free terms in a list such as this because Tr (U U +) = 
2 is a constant. It is clear that one can generate innumerable similar terms with 
arbitrary numbers of derivatives. The general lagrangian can be organized by the 
dimensionality of the operators, 


L = Li+ L4 t+ Lo + Ls +- 
2 


= z Tr (a,Ud"U") + &\[ Tr (3,Ua"U")]? 


+ €)Tr (ð U3 UŤ) - Tr (A“U9"U") +---. (4.3) 


The important point is that, at sufficiently low energies, the matrix elements of 
most of these terms are very small since each derivative becomes a factor of the 
momentum q when matrix elements are taken. It follows from dimensional analy- 
sis that the coefficient of an operator with n derivatives behaves as 1/M"~*, where 
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M is a mass scale which depends on the specific theory. Therefore, the effect of 
an n-derivative vertex is of order E"/M"~‘, and, at an energy small compared 
to M, large-n terms have a very small effect. At the lowest energy, only a single 
lagrangian, the one in Eq. (3.3) with two derivatives, is required. We shall call this 
an ‘O(E’)’ contribution in subsequent discussions. The most important correc- 
tions to this involve four derivatives, and are therefore ‘O(E*)’. In practice then, 
the infinity of possible contributions is reduced to only a small number. The coef- 
ficients of these terms are not generally known, and must thus be determined phe- 
nomenologically. However, once fixed by experiment (or by matching to the full 
theory if possible) they can be used to allow predictions to be made for a variety of 
reactions. 


Power counting and loops 


It would appear that loop diagrams could upset the dimensional counting described 
above. This might happen in the calculation of a given loop diagram if, for example, 
two of the momentum factors from an O(E*) lagrangian are involved in the loop 
and are thus proportional to the loop momentum. Integrating over the loop momen- 
tum apparently leaves only two factors of the ‘low’ energy variable. It would there- 
fore seem that for certain loop diagrams, an O(E*) lagrangian could behave as if it 
were O(E”). If this happened, it would be a disaster because arbitrarily high order 
lagrangians would contribute at O(E*) when loops were calculated. As we shall 
show, this does not occur. In fact, the reverse happens. When O(E7) lagrangians 
are used in loops, they contribute to O(E*) or higher. 

Before we give the formal proof of this result, let us note that we saw this 
effect in the linear sigma model calculation above. We started by using the order 
E? lagrangian in the loop diagram and the result was the renormalization of a 
lagrangian at order E*. It is also straightforward to demonstrate why this occurs. 
Consider a pion loop diagram, as in Fig. IV-2. From the explicit form displayed in 
Eq. (3.5), we see that 

(loop) — 1 1 
M+ Oe = a ee Po; P+); (4.4) 


MER 


where 7 is the loop integral with the factor v™^ extracted. Counting powers of 
energy factors is most easily done in dimensional regularization. The loop integral 
contains no dimensional factors other than p+, po, and p’. Since, in four dimen- 
sions it has the overall energy unit E*, it must therefore be expressible as fourth 
order in momentum. Despite the loop integration, the end result is expressed only 
in terms of the external momenta. These momenta are small, and hence all the 
energy factors involved in power counting are taken at low-energy. In dimensional 
regularization, there can also be a dependence on the arbitrary scale u, 
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J dla pu / d'e, (4.5) 


but in the limit d —> 4 this occurs only in dimensionless logarithms such as 
In(E?/u?). Thus, the order of momentum can be found by counting the factors 
of 1/v? which occur for every vertex from the lowest-order lagrangians. Each fac- 
tor of 1/v? must be accompanied by momenta in the numerator in order to produce 
a dimensionless amplitude. Each vertex in a diagram contributes powers of 1/v?, 
and higher-order loop diagrams require more vertices. Thus, every time a loop is 
formed, the overall momentum power of the amplitude must increase rather than 
decrease. 

We have also seen that any divergences present can be handled in the usual way, 
by renormalizations of the parameters in the theory. Again, the uncertainty prin- 
ciple comes into play — the divergences come from the extreme high-energy part 
of the calculation and thus they must look like some term in a local lagrangain. 
If the original effective lagrangian which we have written down is indeed the 
most general one consistent with the given symmetry, then it must have enough 
parameters of the right form to encompass any divergences which occur. In par- 
ticular, our power-counting argument tells us that when £2 is used in one-loop 
diagrams, the divergences are of order E* and should be capable of being absorbed 
into the parameters of that order. Since the parameters are generally unknown and 
are to be determined phenomenologically, the only difference this makes is to 
cast physical results in terms of the renormalized parameters instead of the bare 
ones. 


Weinberg’s power-counting theorem 


To prove this result [We 79b], consider some diagram with a total of Ny vertices. 
Then letting N, be the number of vertices arising from the subset of effective 
lagrangians which contain n derivatives (e.g. N4 is the number of vertices com- 
ing from four-derivative lagrangians), we have Ny = &,N,,. The overall energy 
dimensionality of the coupling constants is thus M‘¢ with 


Nc =} N,4-n), (4.6) 


where M is a mass scale entering into the coefficients of the effective lagrangian 
(e.g., the quantity v in the sigma model). Each pion field comes with a factor of 
1/v, so that associated with Ng external pions and N; internal pion lines is an 
energy factor (1/M)*’*%, (Recall that two pions must be contracted to form an 
internal line.) However, the number of internal lines can be eliminated in terms of 
the number of vertices and loops (N_), 
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N;=Ni+Ny-1=N, +Y N,- 1. (4.7) 
Any remaining dimensional factors must be made up of powers of the energy E 
times a dimensionless factor of E /u where u is the scale employed for renormal- 
izing the coupling constants. (When using dimensional regularization, these factors 
of E/u enter only in logarithms.) Thus the overall matrix element is composed of 
energy factors 


1 


~ Èr Nn(n—4) 
M ~ (M) MNE+2NL +2 doy Nn—2 


E?F(E/p) 


~ (mass or energy) ^E, (4.8) 


where the second line is the overall dimension of an amplitude with Ng external 
bosons. The renormalization scale u can be chosen of the order of E so no large 
factors are present in F(E /u). Overall the energy dimension is then 


D=2+) N,n-—2)+2N;. (4.9) 


A diagram containing Nz; loops contributes at a power E?™L higher than the tree 
diagrams. This theorem is of great practical consequence. At low energy, it allows 
one to work with only small numbers of loops. In particular, at O(E*) only one- 
loop diagrams generated from £ need to be considered. 

The end result is a very simple rule for counting the order of the energy expan- 
sion. The lowest-order (E°) behavior is given by the two-derivative lagrangians 
treated at tree level. There are two sources at the next order (E^): (i) the O(E”) 
one-loop amplitudes, and (ii) the tree-level O(E*) amplitudes. When the coeffi- 
cients of the E* lagrangians are renormalized, finite predictions result. Other effec- 
tive field theories will have power-counting rules analogous to this one appropriate 
for chiral theories. 


The limits of an effective field theory 


The effective field theory of the linear sigma model is valid for energies well below 
the mass of the scalar particle in the theory, the o or S. Once there is enough energy 
to directly excite the S particle, it is clear that the effective theory is inadequate. 
This energy scale is visible even within the effective theory itself. Scattering matrix 
elements are an expansion in the energy, with a schematic form 
2 2 
M~Sli+54..| (4.10) 


and the scale of the energy dependence is determined largely by the scalar mass. As 
the energy increases the corrections to the lowest-order result grow and eventually 
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all terms in the energy expansion become equally important and the effective theory 
breaks down. Thus, the effective theory reveals its own limits. 

In more general effective field theories, there is always a separation of the heavy 
degrees of freedom, which are integrated out from the theory, and the light degrees 
of freedom, which are treated dynamically. In many instances, the natural sepa- 
ration scale is set by a particle’s mass, as in the linear sigma model. We will see 
that in the case of QCD, the meson resonances such as the o(770) do not appear 
explicitly in the low-energy effective theory. Therefore, these have been integrated 
out and help define the limits of the effective field theory. In other cases, we could 
integrate some of the high-momentum modes of certain fields, while still keeping 
the low-momentum modes of these same fields as active dynamical participants in 
the low-energy theory. This is done for the effective hamiltonian for weak decays, 
where we integrate out the high-energy modes of the gluonic fields. In these cases, 
the scale that we have used to separate high and low energy defines the limit of 
validity of the effective field theory. 

Let us also address a rather subtle point concerning the energy scale of the effec- 
tive theory. While we regularly use this idea of an energy scale defining the limit 
of validity of the effective theory, there are times that we do not apply this sepa- 
ration fully. In loop diagrams, if we wanted to only include loop effects below a 
certain energy scale, we would need to employ a cut-off in the loop integral. This 
is often inconvenient and if done carelessly could upset some of the symmetries of 
the theory. Moreover, the presence of an additional dimensional factor in loop dia- 
grams would upset some of the power-counting arguments described above. Most 
often, practical calculations are performed using dimensional regularization. This 
regulator has no knowledge of the energy scale of the theory and thus loop dia- 
grams will in general include effects from energies where the effective theory is 
not valid. However, again the uncertainty principle comes to our rescue. Even if 
these spurious high-energy contributions are not correct, we know that their effect 
is equivalent to a local term in the effective lagrangian. Any mistakes made in the 
loop can be corrected by modifying the coefficients of the terms in the effective 
lagrangian. Careful application of the procedures for matching or measuring the 
parameters will return the the same physical predictions independent of the choice 
of regularization scheme. 


IV-5 Symmetry breaking 


Effective lagrangians can be used not only in the limit of exact symmetry but also 
to analyze the effect of small symmetry breaking. Let us first return to the sigma 
model for an illustration of the method, and then consider the general technique. 


IV-5 Symmetry breaking 125 


The SU(2); x SU(2)r symmetry of the sigma model is explicitly broken if 
the potential V(o, m) is made slightly asymmetric, e.g., by the addition of the 
term 


Loreaking = dao = tt (= T £+) (5.1) 


to the basic lagrangian of Eq. (1.4). To first order in the quantity a, this shifts the 
minimum of the potential to 


a4) + —, 5.2 
v i + TE (5.2) 
and produces a pion mass 
m? = A (5.3) 
v 


Although the latter result can be found by using the linear representation and 
expanding the fields about their vacuum expectation values, it is easier to use the 
exponential representation, 


` 2 
Loresking = FO + S) Tr (U +U’) = F0 + S) Tr (2- (—) +) 


a m? 
=a(0 15) = Ge pe Sa 8) T (5.4) 
v 


The chiral SU (2) symmetry is seen to be slightly broken, but the vectorial SU (2) 
isospin symmetry remains exact. 
As we have seen, the O©(E?) lagrangian is obtained by setting S = 0, 


v? m 
La = z T (3,Ua"U") + Ftv Tr (U + UT). 2) 


Higher-order terms will contain products like 
[m2 Tr (U+U*)], m2 Tr (U +U’) - Tr (a,U9“U'), ..., (5.6) 


and can be obtained by integrating out the field S as was done in Sect. IV-2. It 
is important to realize that the symmetry-breaking sector also has a low-energy 
expansion, with each factor of m? being equivalent to two derivatives. If m2 is 
small, the expansion is a dual expansion in both the energy and the mass. 

If we encounter a theory more general than the sigma model, the effect of a small 
pion mass can be similarly expressed in low orders by, 


Loveing = aim, Tr (U + U) + az [m3 Tr (U + U’) 
+ aym2 Tr (U + U*) Tr (3 UaU) + aqm? Tr [(U + U*)9,U0"U), 
(5.7) 
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with coefficients that are generally not known. An important consideration is the 
symmetry-transformation property of the perturbation. The symmetry-breaking 
term of Eq. (5.1) is not invariant under separate left-handed and right-handed trans- 
formations but only under those with L = R. All the terms in Eq. (5.7) have this 
property. 

Other symmetry breakings can be analyzed in a manner analogous to the treat- 
ment just given of the mass term. One identifies the symmetry-transformation prop- 
erty of the perturbing effect and writes the most general effective lagrangian with 
that property. Most often the perturbation is treated to only first order, but higher- 
order behavior can also be studied. 


IV-6 Matrix elements of currents 


There is an elegant technique which allows one, at a minimal increase in com- 
plexity, to calculate matrix elements of currents from a chiral effective lagrangian 
[GaL 84, 85a]. The idea is to add to the lagrangian terms containing external 
sources coupled to the currents in question. Construction of the effective lagran- 
gian, including source terms, then allows the current matrix elements to be easily 
identified. We shall explain this technique here, and use it extensively in our dis- 
cussion of QCD in subsequent chapters. 

First, consider how current matrix elements are identified in a path-integral 
framework. We have seen in Chap. III (see also App. A) that by adding a source 
coupled to the desired current, matrix elements can be obtained from differentia- 
tion of the path integral, e.g., Eqs. (III-2.2), AII-2.4). For example, we can modify 
three-flavor QCD by adding sources to obtain 


1 =. - 1+ - 1- 
L=- FREE + Wi Dy -Pyu ty -Pyu r" 
— Wi(s +ip)Wr — Urls — ip), (6.1) 


where ¢,,, ru» S, p are 3 x 3 matrix source functions expressible as 
C= o, + LAS, i= r; + rn, s = s} +574, p= p° + pà", (6.2) 


witha = 1,..., 8. The lagrangian in Eq. (6.1) reduces to the usual QCD lagrangian 
in the limit £, = r, = p = 0, s = m, where m is the 3 x 3 quark mass matrix. The 
electromagnetic coupling can be obtained with the choice £, = r, = eQA,, where 
A,, is the photon field and Q is the electric charge operator defined in units of e. 
Various currents can be read off from the lagrangian, such as the left-handed current 
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aL 1+ ys 
00K (x) 2 


= V(x) Yn M(x) (6.3) 


i == 


or the scalar density 


Wa) (x) = (6.4) 


ðs? (x) 
Moreover, matrix elements of these currents can be formed from the path integral 


by taking functional derivatives. The simplest example is 


= .ôlnZ 
(0 |v (x) w(x)| 0) = i—— (6.5) 


ôs? (x) £=r=p=0 i 
s=m 


while other examples appear in Sect. I-2. 


Matrix elements and the effective action 


A low-energy effective action for the Goldstone bosons of QCD will be a functional 
of the external sources. One way to define the connection of the effective action 
with QCD is to consider the effect of the sources, 


ei Wuwus P) — J [dy] [ay] [a Aa] ef fx N (6,6) 


At low-energy, all heavy degrees of freedom can be integrated out and absorbed 
into coefficients in the effective action W. However, the Goldstone bosons propa- 
gate at low-energy, and they must be explicitly taken into account. One then writes 
a representation of the form 


ei W Cuts.) — fiav gif dx Lett(U,Lu,Fu,S,p) (6.7) 


where as usual U contains the Goldstone fields. This form then allows inclusion of 
all low-energy effects while maintaining the symmetries of QCD. 

The lagrangian of Eq. (6.1) has an exact local chiral SU (3) invariance if we 
have the external fields transform in the same way as gauge fields. In particular, the 
transformations 


vi > LY, Yr > R(x)YRr, 

la > LAHU Li) + 18, L(x) LE" (x), 

ra > R(x)r,R* (x) +14, R(x)R* (x), (6.8) 
(s +ip) > L(x\(s +ip)R' (x) 


provide an invariance for any L(x), R(x) in SU(3). 
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In constructing the effective action, these invariances must be included. This is 
easy to do if £, and r, enter in the same way as gauge fields. In particular, upon 
defining a covariant derivative 


D,U = 0,0 +14,0 = Urq, (6.9) 
and field-strength tensors 


Luv = Only — Oe, + ily, Col, 
Ruy = Jury — Oy + trp, rl, (6.10) 


we obtain the following covariant responses to local transformations: 


U > L(x)UR"(x), D,U > L(x)D,U RÌ (x), 


Lug La Lael (x), Ruy > R(x)RwR* (x). old) 


The effective action is then expressed in terms of these quantities. At order E’, 
there are only two terms in the effective lagrangian, 


F2 i e a 
Ly = -$ Tr (D,UD”U*) + Tr (xU + Ux’), (6.12) 
where 
X = 2Bo(s + ip) (6.13) 


and Bo is a constant with the dimension of mass. In the limit £, = r, = p = 0, 
s = m, this is the same effective lagrangian with which we have been dealing in 
the SU (2) examples, with the identification m? = (m, + mq) Bo. Note that this 
usage requires Bo to be positive. 

Having constructed the effective action, we can obtain a number of interesting 
matrix elements. For example, use of Eq. (6.5) provides the identification of the 
vacuum scalar-density matrix element as 


(0 |Wiw;| 0) = — F? Bodi; (6.14) 


to this order in the effective lagrangian. Similarly, use of Eq. (6.3) reveals the left- 
handed current to be 
k F k i 
Ly =i T UO"). (6.15) 
One other advantage of the source method is to allow the use of the equations of 


motion. The standard Noether procedure for identifying currents does not work if 
the equations of motion are employed in the lagrangian. To become convinced of 
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this, one can consider the following exercise. We examine the response of the two 
trial lagrangians, 


Li = Op, L.=—-m’o*o (6.16) 


to a phase transformation yg —> e'%g. The first contributes to the Noether current 
while the second does not. However, these two forms are identical on-shell if g 
satisfies the Klein—Gordon equation. In an effective lagrangian which is meant to 
be used always on-shell it is often convenient to drop terms which vanish by virtue 
of the equations of motion. The use of source fields as described above avoids this 
problem. 


IV-7 Effective field theory of regions of a single field 


In our presentation earlier in this chapter, the construction of an effective field 
theory was described by the integrating out of heavy particles, while leaving the 
light particles as dynamical degrees of freedom. However, often one can make an 
effective field theory from a single particle. In this case, certain energy regions of 
the field are treated as heavy and others are light, and one retains the light regions 
in the effective field theory. Indeed, sometimes there are multiple regions that are 
‘light’ in some sense, and one splits the original single field into multiple fields. 
This section provides some of the background for such decompositions. 

The simplest example of the division of a single field into ‘heavy’ and ‘light’ is 
in the nonrelativistic reduction. When the energy is small, the antiparticle degrees 
of freedom are heavy and can be removed from the theory, leaving a nonrelativistic 
particle description. For example, if one redefines a four-component Dirac field y 
into upper and lower two-component fields, yy, and ye by factoring out the leading 
energy dependence at low-energy via 


(7.1) 


w(x, t) = eint Us (x, a) 


Welx, t) 
w, will behave as a nonrelativistic field and yẹ will account for the two heavy 
degrees of freedom. The free Dirac lagrangian shows this separation, 


L= wip -my 
= Wi + lið + Wm + prio Vet plio -Vyu (7.2) 


While no approximation has yet been made by this redefinition, the nonrelativistic 
limit is taken by assuming that the residual energy dependence is small compared 
to the mass (i.e., one neglects 0, compared to 2m). One can then integrate out the 
lower component through its equation of motion, 
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(id, + 2m) pe © 2m pe = io -VYu, (7.3) 
leaving the upper component as the active non-relativistic degree of freedom. 
Vue) Vu 
L = piid Wu — yi) -Vvu ; (7.4) 
2m 


With inclusion of the interactions, this can lead to a full nontrivial effective field 
theory. A well-developed example of this is the Non-Relativistic QCD (NRQCD) 
effective field theory [CaL 86]. We will also return to this procedure in more gen- 
erality in the discussion of Heavy Quark Effective Theory (HQET) in Chap. XII. 

A second common way of splitting up a single field is to integrate out the high 
momentum portions of a field. This logic is often called Wilsonian [Wi 69]. Imag- 
ine splitting the momenta in a problem into those above an energy scale A and 
those below this scale. By first performing the calculation of the high-energy por- 
tion, one is left with an effective field theory. The operators defining that theory 
will carry factors, the Wilson coefficients, that depend on the scale A. This means 
that one obtains a set of new operators O, in the lagrangian 


L=.. + D Ca (A) On, (1.5) 


where C,,(A) are the Wilson coefficients and the series is infinite. The operators 
are local because they capture high-energy physics, and their matrix elements will 
depend on the separation scale, (O,,) = (O,(A)). One regularly uses the renor- 
malization group to describe the running of the Wilson coefficients with changes 
of scale. The low-energy theory remains a full field theory and one must cal- 
culate the full quantum effects in the matrix elements of ©, up to the scale A. 
When the high-energy physics in C, and the low-energy physics in the matrix ele- 
ments of O, are properly matched, in the end the separation scale A will disap- 
pear from the description. Nevertheless, this separation is often useful. For exam- 
ple, in QCD the high-energy behavior may be reliably calculated in perturbation 
theory, while the low-energy behavior may be best accomplished with lattice cal- 
culations. Examples appearing in this book include the Wilson coefficients of the 
non-leptonic weak hamiltionian, cf. Sect. VIII-3, and those used in QCD sum rules, 
cf. Sect. XI-5. 

In practice, however, we most often do not use a Wilsonian separation scale A, 
but instead employ dimensional regularization. Dimensional regularized loop inte- 
grals do not carry information about any particular scale, and therefore extend over 
all energies. The extension to d < 4 damps the high-energy divergences in a scale- 
independent way. Nonetheless, this procedure works for logarithmically running 
Wilson coefficients. Aside from the momenta, the only scale in a dimensionally 
regularized integral is the u” inserted in front of the loop integral. This ends up 
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Fig. IV-3 The scalar vertex diagram analysed in the text. 


appearing in the final answer as In jz” when expanded close to d = 4. The fact that 
cut-off regularization and dimensional regularization have the equivalence 


1 
In A? s - +In u? (1.6) 


allows the scale u to be a proxy for the separation scale A. However, the correspon- 
dence of u with a Wilsonian separation scale does not hold for Wilson coefficients 
with power-law running [CiDG 00]. 

For a yet more subtle example, consider the interaction of a high-energy massless 
particle in the vertex diagram of Fig. IV-3. For the purposes of this example, let us 
consider these as scalars and the current vertex as J = y*/2. We can analyse the 
resultant scalar vertex integral, 


ao 1 1 1 
(27)! (p +k)? k? (p' + k)?’ 
in the limit where p? ~ p°? < Q? = (p — p’)’. The only scales in this problem 
are Q*, which is treated as a large scale, and p ~ pr which is the small scale. 
The relative size is labeled 47 ~ p?/Q? ~ p°?/Q?. 

This integral can be analyzed by the method of regions [BeS 98, Sm 02].8 In this 
technique, one identifies all the important momentum regions of the loop integral, 


I= 4—d 


‘a (7.7) 


and makes appropriate approximations within each region. A portion of the integral 
will have all the components of the loop momenta of order Q and higher. This 
will be called the hard region. A region labeled soft has all the components much 
smaller than Q. In addition, there will be regions where the momentum is of order 
Q in the direction of p or p’. In these collinear regions, some invariant products 
can be smaller than Q?. 

In order to quantify this one takes light-like reference four-vectors 


n” = (1,0,0,1), A“ = (1,0,0,—1) n?=f7?=0, n-ñ=2. (7.8) 


For an arbitrary four-vector expressed using these and a transverse component, 


ji n” n” i n” n't ji 9 
Vi =n-V—-+n-V Vi =v. VL Var 7. 

my +n 5 + £5 + 7 + (7.9) 
the invariant product is 


8 This example and the treatment of it follows the lectures of [Be 10]. 
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V? = (n: V). V) + V? = V V+ VE. 
1 
A,B} = g eB A A By (7.10) 


These are useful because we can choose a frame with p along n and with p’ along 
n, and we can refer to the n direction as ‘right’ and the n direction as ‘left’. This 
allows us to classify the different regions. Of the original momenta, we have 


(V+, Veg Vi) 
p~oO?, 1, 0 Q 
p~, D7; 0) Q. (7.11) 
Q~, 1, 0Q 


Q is a hard momentum because it takes a hard interaction to change an energetic 
right-moving particle into one moving left. Using this decomposition, one can iden- 
tify the regions of the loop momentum 


(k+, k_, k1) 
k~(1, 1, 1)Q hard 
k~(A?, 1, A) Q collinear R. (7.12) 


k~(1, 47, 4) Q collinear L 
k ~(A7,47,47) Q soft 
In each region, one can drop small momentum components in terms of large 


ones. For example, when k is in the hard region, one can drop p’, pA k- p+, 
ką p_, which are all of order 7, in order to obtain? 


aa Tak 1 
Thard = M d A . 2 . 2 F . 
(270)? (k2 +i (k? +k_py +ie(k? +k, p' +ie) 
ira 1 1 2 1 2 E 
ER) ho at e E (7.13) 
(4r)! Q? |@e € -Q? 2 -Q? 6 


Similarly, in the right collinear region, one can expand (k + p’)? = k_ p’. + OA’), 
such that 
4a [f d'k 1 
kor = H A TIZES 7: 7 : 
(27)* (k* + ie)((k + p)* + te) (k_ py + ie) 
iT 1 1 ‘ 1 z 
| 2 In? É += 


= (4x )4/2 Q? 
An observation that will be relevant for the eventual construction of an effective 
theory is that when the exchanged propagator carrying momentum k is in the right 


—-In + 


— 7.14 
E2 č — p’? 2 — p’? ( ) 


9 The integrals of this section are displayed in the useful appendix of [Sm 02]. See also [Sm 12]. 
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collinear region, the other propagator on the p side is also collinear, but the third 
propagator on the p’ side is hard. A similar result is obviously found when k is in 
the left collinear region, obtained by replacing p by p’. Finally, in the soft region, 
one keeps only terms of order A”, finding 


4a f d'k il 
(27)! (k? + ie) (k-p+ + p? + ie) (ky pl + p? + ie) 
iT(_dite) [1 1, w@ Le CO m? 
= In In + 
(4x )4/? Q? e2 € —p? p? 2 —p? p? 6 
If one tries to identify other regions besides these and makes the correspond- 
ing simplifications of the loop integral, one ends up with a scale-less integral 
which vanishes within dimensional regularization. For example, if one considers 


the region where k scales as k ~ (A?, 47, A)'°, one would use k? ~ kî and keep 
terms of order à? in each propagator 


Toft = 


(7.15) 


d¢k 1 


— 
(27) (ki + ie) (k-p} + p+ ki tie) (ke pl + p? + ki + ie) 


1 | dik 1 
pip. J (27)? (ki + ie) (kL + ie) (ki, + ie) 
=0, (1.16) 


where in the second line we have defined shifted variables k = k_+(p?+k7)/p4 
and ki, = k4 + (p' 2 + k+)/p!, with the result being an integral without any scale. 
Such integrals are set to zero within dimensional regularization. 

The sum of the four subregions yields the correct total integral, 


. 2 

jac [h Oin +E], (7.17) 
167202 p p 3 

up to terms suppressed by powers of à. As expected, this result is finite, even 

though the integrals from the individual regions are not. The approximations that 

we made lead to infrared divergences in the hard integral, and ultraviolet diver- 

gences in the others. However, these cancel when added together. 

The other interesting feature of this procedure is that we have not restricted the 
integration ranges when calculating the integrals for the different regions. The full 
integration range is used in each case. The reason that this does not amount to 
double counting within dimensional regularization is that if there is a single unique 
scale within the integral, as has been deliberately constructed in each region, the 
integral is determined by momenta around that scale. This is the key observation 


10 This region is referred to as the Glauber region. The treatment of the integral given in the text appears 
adequate for this example, although the understanding of the Glauber region is still evolving [BaLO 11]. 
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that allows the method of regions to work. By constructing approximations that 
scale in unique fashions, one can isolate the physics of that region alone.!! That 
this actually happens in these integrals can be seen from the above integral where 
the factors of Q?, p?, and p*p’/Q? all signal the dominant scale in the respective 
diagrams, showing that the effects come from different regions of the momentum 
integration. 

One can convert the analysis of the method of regions into an effective field 
theory whose applicability extends beyond this particular example. The initial field 
can be divided up into new effective fields for each of the important regions. The 
goal is to choose these fields and their interactions to yield the same results as 
the method of regions analysis outlined above. The hard-momentum region can 
be integrated out completely and replaced by effective operators of the light fields. 
These operators will come with Wilson coefficients to ensure the matching with the 
full calculation. However, the dynamical light fields need to come in three varieties 
for the different light-momentum regions. Thus, the original scalar field g(x) now 
comes in three varieties, o (x) = per (x) + peL (x) + G(x). The interactions of the 
light fields among themselves is relatively simple to construct. If the interaction 
vertex of the original theory was a simple gy? vertex, we expand that to include the 
possible interaction between the light fields, 

E 3 


E E 
Pir + ZPeL t 3Ps + BPRS + EPEL: (7.18) 


a a -3 
Vertices not listed above, such as ge RO? are ones which cannot occur due to 
momentum conservation (e.g., a collinear particle cannot split into two soft par- 
ticles). 

It is somewhat more subtle to choose the other effective operators and their 
Wilson coefficients. For the scalar example shown above, the ‘current’ carrying the 
momentum Q in the full theory is J = y7/2. Since it transfers this large momen- 
tum it can connect er to Pez such that we expect a vertex J ~ PerRQeL. However, 
in addition we need to recall that we have integrated out the hard scalars. This leads 
to additional vertices. For example, in the diagram of Fig. IV—4(a) the propagator 
is hard because it carries the momenta of both left-moving and right-moving fields, 
which couple to it at the lower vertex. When the other fields are light, this propa- 
gator shrinks to a point vertex as in Fig. [V-4(b). This, then, is a new contribution 
to the current operator, and we expect that the current has the form 


J = CPeRPeL + C30 pPeL F Ci perp apt (7.19) 


11 Tn cases where regions are defined which have overlapping contributions there are also methods for cleanly 
separating the regions [MaS 07]. 
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P /k p' P/k p' 
(a) (b) 


Fig. IV—4 (a) An interaction of collinear particles through a hard propagator; (b) 
the effective local vertex representing this interaction at low-energy. 


where Cz and C3 are the Wilson coefficients. Calculation from the original theory 
shows that to this order 


2g 


2 
— ey C} z 
kip. —ie 


Co = 1 + 8° Ihara, C3 3 = > 
kąp-— ie 


; (7.20) 
where Ihara refers back to Eq. (7.13). 

At this stage, we can reproduce the original vertex calculation using the effective 
theory by the calculation of the diagrams of Fig. IV-5. The diagrams of Fig. IV-5 
(a),(b),(c) refer to the new vertices given in Eq. (7.20), while Fig. IV-5 (d) refers to 
the soft contribution of Eq. (7.15). By construction, one can see how all four of the 
regions of the original diagram are reproduced. We note how the hard propagators 
that occur when k is in one of the collinear regions have been accounted for by a 
new local vertex in the current operator, with the Wilson coefficient describing the 
effect of the hard propagator. 

The reader may object that the construction of the effective theory was more 
trouble than evaluating the original diagram. However, once we have developed 
the effective theory, we can apply it in multiple new contexts. The example above 
is analogous to the Soft Collinear Effective Theory (SCET) of QCD [BaFPS 01]. 
Similar techniques are used in the various realizations of NRQCD [CaL 86, PiS 
98, BrPSV 05]. Outside of the Standard Model, related methods are applied in 
the classical effective field theory of General Relativity [GoR 06], which has been 
used to systematize the classical treatment of gravitational radiation from binary 
systems [PoRR 11]. Further development of the method of regions and threshold 
expansions can be found in [BeS 98, Sm 02]. 


Fig. IV-5 The diagrams involving the light fields reconstructing the scalar vertex. 
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(a) (b) 


Fig. 1V-6 Photon amplitudes containing a single fermion loop. 


IV-8 Effective lagrangians in QED 


We have explored in some detail the structure of effective field theory by using 
chiral symmetry as an example. However, this is not meant to imply that effective 
lagrangians are useful only in that one context. In fact, they can be applied to a 
wide variety of situations. Here, we apply the technique to QED. 

Consider situations in which the photon’s four-momentum is small compared to 
the electron mass. In such cases, the electron and other fermions cannot be pro- 
duced directly, but instead influence the physics of photons only through virtual 
processes. The lowest-order diagrams, i.e., those which contain a single electron 
loop, with increasing numbers of external photon legs, are shown in Fig. IV—6. 
Note that the one-loop diagram containing three photons, or indeed any odd num- 
ber of photons, vanishes by virtue of charge-conjugation invariance. This is true 
to all orders in the coupling e, and is refered to as Furry’s theorem. Diagrams like 
those in Fig. IV—6 have effects at low-energy which are typically calculated in per- 
turbation theory. The associated amplitudes have coefficients which scale as some 
power of the inverse electron mass. They can be generated by means of an effective 
lagrangian, as we shall now discuss. 

Let us seek a description which eliminates the electron degrees of freedom. That 
is, we wish to write a lagrangian which involves only photons, but nevertheless 
includes effects like the ones in Fig. IV—6. The result must of course be gauge- 
invariant. The procedure may be defined by the path-integral relation 


J [dA,,| exp f l d‘x Lay) 


_ f [dAu lav dw] exp [i f d*x Loep(Au. W, W)| 
7 Sldwl[dv] exp[i fd4x Loy, »)] 


(8.1) 


where Lorp is the full QED lagrangian, and Lp is the free fermion lagrangian. 
Thus Ler has precisely the same matrix elements for photons as does the full 
QED theory. Specifically, it includes the virtual effects of electrons. The techniques 
described in App. A-5 enable us to formally express the content of Eq. (8.1) as 
[Sc 51] 
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[as Les(A,) = -i J dx Fy, FY — iTr In É = . BED 
This form, although formally correct, does not readily lend itself to physical inter- 
pretation. However, we can determine various interesting effects directly from per- 
turbation theory. For example, the vacuum polarization of Fig. IV—6(a) modifies 
the photon propagator, i.e., the two-point function. From Eqs. (II-1.26), (II-1.29), 
we determine the result for a photon of momentum q to be 


A . a 2\ 7 : 

iwa) = iggy (Gud — Bu") Ha Ho (8.3) 
The essence of the effective lagrangian approach is to represent such information 
as the matrix element of a local lagrangian. In the present example, we find that the 
term in Eq. (8.3) corresponds to the interaction 


Lett Payor. (8.4) 


602m? 
where O = 0,0". 

The calculation of Fig. [V—6(b) is a somewhat more difficult, but still straightfor- 
ward, exercise in perturbation theory. We shall lead the reader through a calculation 
using path integrals in a problem at the end of this chapter. It too can be represented 
as a local lagrangian, and is usually named after Euler and Heisenberg [ItZ 80]. One 
finds the full result to one-loop order to be 


1 a 
LA == lak +> wr 
BE gE E onma ” 
a 2 7 ~ 2 
+ | (FP) +5 (Fak) J à (8.5) 
where Fu» = 5€nvap F°’. Corrections to this effective lagrangian can be of two 


forms: (i) even at one loop there are additional terms of higher dimension 


2 


o v Oo v Q 1 v 3 
Fug F”, Fov Fl Fog F P, (Eno F?) gaias (8.6) 


ms 
involving either more fields or more derivatives; or (ii) the coefficients of these 
operators can receive corrections of higher order in œ through multi-loop diagrams. 
We see here an example of the energy expansion, which we have discussed at 
length earlier in this chapter. In this case it is an expansion in powers of q? /m?. 
The effective lagrangian of Eq. (8.5) can be used to compute aspects of low-energy 
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photon physics such as the low-energy contribution of the vacuum polarization 
process or the matrix element for photon—photon scattering. 


IV-9 Effective lagrangians as probes of New Physics 


One of the most common and important uses of effective lagrangians is to param- 
eterize how new physics at high energy may influence low-energy observables. 
The general procedure can be abstracted from our earlier discussion. Remember 
that one is trying to represent the low-energy effects from a ‘heavy’ sector of the 
theory. This is accomplished by employing an effective lagrangian 


Lett = > Cn On, (9.1) 


where the {O,} are local operators having the symmetries of the theory and are 
constructed from fields that describe physics at low-energy. There need be no 
restriction to renormalizable combinations of fields. Most often the operators can 
be organized by dimension. The lagrangian itself has mass dimension 4, so that if 
an operator has dimension d; the coefficient must have mass dimension 


Ch ~ M, (9.2) 


The mass scale M is associated with the heavy sector of the theory. It is clear that 
operators of high dimension will be suppressed by powers of the heavy mass. To 
leading order, this allows one to keep a small set of operators. 

Some applications will involve phenomena for which the dynamics is well under- 
stood. If so, the coefficients of the effective lagrangian can be determined through 
direct calculation as in the preceding sections. Other examples occur in the the- 
ory of weak nonleptonic interactions (cf. Sect. VIII-3) and in the interactions of 
W-bosons (cf. Sect. XVI-3). Even more generally, effective lagrangians can also 
be used to describe the effects of new types of interactions. In these cases, dimen- 
sional analysis supplies an estimate for the magnitude of the energy scales of pos- 
sible New Physics. We shall conclude this section by using effective lagrangians 
to characterize the size of possible violations of some of the symmetries of the 
Standard Model. 

Given certain input parameters, the Standard Model is a closed, self-consistent 
description of physics up to at least the mass of the Z°, and is described by the 
most general renormalizable lagrangian consistent with the underlying gauge sym- 
metries. What would happen if there were new interactions having an intrinsic 
energy scale of several TeV or beyond? In general, such new theories would be 
expected to modify predictions of the Standard Model. The modifications would 
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be described by non-renormalizable interactions, organized by dimension in an 
effective lagrangian description as 


Lee = Lsm + ics + = fe E (9.3) 


where £,, has mass dimension n and A is the energy scale of the new interaction. 

There is a single operator of dimension 5 which will be displayed in the neu- 
trino chapter. At dimension 6, there are 80 distinct operators consistent with the 
gauge symmetries of the Standard Model [BuW 86]. These can generate a variety 
of effects which deviate from the Standard Model. For example, the operator 


f € v 
Ls (c') = mi D)W W”, (9.4) 


containing the Higgs field ® and the field tensor W,,, of SU (2) gauge bosons, 
produces a deviation from unity in the rho-parameter, !? 

My, v? 
— =l- -dm H. 9.5 
M3 cos? Ow A2 (Aa 


p 


The current level of precision, |o — 1| < 0.0029 (at 20), requires A > 4.5 TeV for 
c’ = 1. Yet another possibility concerns the violation of flavor symmetries in the 
Standard Model. The operator, 


c" 


Le(c") = Kae Yall + ys)u Sy” (1 + ys)d + h.c., (9.6) 


conserves generational or aN number, but violates the separate lepton-number 
symmetries. It leads to the transition Kz —> e~ u™ such that 


Dko te ç" i 
= zh" (9.7) 
Tkt utv VusG p A 


The present bound, Brko ye < 4.7 x 10°! at 90% confidence level, requires 
A > 1700 TeV for c” ~ 1. In a similar manner, constraints on other physical 
processes imply bounds on their corresponding energy scales A, generally in the 
range 5 —> 5000 TeV. 

Dimension-six contact interactions also are searched for at the highest energies 
of the Large Hadron Collider (LHC). The effect of the contact interaction becomes 
relatively more pronounced at high energy when competing with background pro- 
cesses which fall off due to propagator effects. For example, an operator leading to 


qq > Wu, 


2 


a 
Lele) = 5K21 WIL ALY ML, (9.8) 


12 More precisely the comparison is with a form of the rho-parameter after radiative corrections. 
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becomes increasingly visible over the Drell-Yan process at high energy. Early LHC 
results [Aa et al. (ATLAS collab.) 11] bound this interaction with A > 4.5 TeV 
at 95% confidence for g?/4z = 1; such limits will clearly improve in the future. 
Interestingly, some operators are better bounded by low-energy precision experi- 
ments and others are better probed at high energy [Bh et al. 12], demonstrating the 
value of both lines of research. 

Of course, if there is new physics in the TeV energy range, it need not generate 
all 80 possible effective interactions. The ones actually appearing would depend 
on the couplings and symmetries of the new theory. In addition, the coefficients 
of contributing operators could contain small coupling constants or mixing angles, 
diminishing their effects at low-energy. However, the effective lagrangian analysis 
indicates that the continued success of the Standard Model is quite nontrivial and 
places meaningful bounds on possible new dynamical structures occurring at TeV, 
and even higher, energy scales. 


Problems 
(1) U(A) effective lagrangian 
Consider a theory with a complex scalar field g with a U (1) global symmetry 
o — ọ' = exp (i0) g. The lagrangian will be 
L = 3 p“ 3p + gy — Ag"). 
(a) Minimize the potential to find the ground state and write out the lagrangian 
in the basis 
=! 
p — af 2. 
Show that øz is the Goldstone boson. 
(b) Use this lagrangian to calculate the low-energy scattering of g2 + g2 —> 


2 + 2. Show that despite the nonderivative interactions of the lagrangian, 
cancelations occur such that leading scattering amplitude starts at order p*. 


(v + Q(x) + i2(x)) 


(c) Instead of the basis above, express the lagrangian using an exponential 
basis, 
1 
= —; 
/2 
Show that in this basis a ‘shift symmetry’ x —> x + c is manifest. 
(d) Calculate the same scattering amplitude using this basis and show that the 


results agree. Note that the fact that the amplitude is of order p* is more 
readily apparent in this basis. 


(v + (xei xO., 
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(2) Path integrals and the Fermi effective lagrangian 


(3 


wa 


Consider the path integral Zw = f [dW*t][dW~]exp|i f d*xLw(x)], where 
Ly is the W*-boson lagrangian Ly = LY? + £0, with 


1 

(free) v 2 Cnt) __ §2 

cites) = a hie Mew we, ce de (Wy Jh + hic.) . 

Integrating out the heavy W= fields in Zw leads to an effective interaction 

between charged weak currents called the Fermi model. 

(a) Show that, upon discarding a total derivative term, one can write the free 
field contribution in Zw as 


l dix ci) = J d'x d*y Wła K” (œ, y)W7 O), 


where Kyy(x, y) = d(x — y) [g (8? + My) — 3L]. 
(b) Further steps allow the path integral to be expressed as 


2 
Zw = exp [~ fas d*y JË Awa, DOLE 


where A(x, y) is the Fourier transform of the W~ propagator A,,,(k) = 
— (guv — kyky / My). Upon expanding this form of Zw in powers of My, 
show that to lowest order, 


LEP œ) = E JE ŒJ) (Fermi model), 
where the Fermi constant obeys G p / i= 85 zi (8M7) 
The Euler-Heisenberg lagrangian: constant ma field 
Consider a charged scalar field ọ interacting with a constant external magnetic 
field B = Bk. The corresponding Klein—Gordon equation is (D? +m?) (x) = 
0, where D, = 0,, + ieA,, is the covariant derivative, and the effective action 
is then given by 


fidoi Ghee OO eee) 
fide @\ldg*(Gx)el fax PO +m ee) 
= det(O? + m”)/det(D? + m°’), 

D? +m? 

D+ mm 


ei Sett(B) — 


Sere(B) = i Tr In 


The operation ‘Tr In’ applied to a differential operator is not a trivial one and 
the purpose of this problem is to evaluate this quantity for the case at hand. 
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(a) Demonstrate that 
[0,6] 5 2 
SaB) = iT: f ens (e De ane 
0 


(b) In order to evaluate the trace we require a complete set of solutions to the 
equations 


Dg, (x, y, Z, t) = AnQn(x, y, Z, t), 
Dg, (x, Yy, Z, t) = Kn Qn (x, y, Z, t), 


so that we may write 
SAS 2 
S (B) =i / eae e" Ss eo kns _ eons . 
=D [Se ew em 


(c) With the gauge choice A, = (0, Bxj ) show that the eigenstates are 


g(x, yz t) = e! Exx thy y thee kt) 
p(x y. Z t) = eil kztkyy—kit) pn (x—ky/eB) 


where vy, (x) is an eigenstate of the harmonic-oscillator hamiltonian, and 
the eigenvalues are kn = —k? +k? +k? +k2, An = —k? +k? +eB(2n + 1). 


(d) Rotate to euclidean space and evaluate the trace using box quantization. 
Taking a box with sides L1, L2, L3 and a time interval T, we have 
PO dtk 
K : => LiLaLaT | Paar 
2 -œ (27) 
ve dkodk, 
A: > LoL3T dk 
2 A J Ji my 
where the integration on k, is over all values with x’ = x —k,/eB positive. 
(e) Evaluate the effective action 
Ser(B) = LyLoL3T = n Zr dkod ka (nt +45 +10 
co (27)? 
x eB Şo ereBanes -f dk,dky eT ke+k)s 
an “= (Qn) 


and show that 


Seg(B) = Ly LLT 1 [ dS 2%. eBs 1 
— e ———— —1]. 
i a” 16x? s? sinh e Bs 


Expand this in powers of B, finding the (divergent) wavefunction renor- 
malization and the B4 piece of the Euler-Heisenberg lagrangian. 
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(f) Show that the corresponding result of a constant electric field can be found 
by the substitution B — iE so that 


Gop ey an ee [Ss s S _y 
eE TES" Tom? Jo s i sine Es l 


(g) Demonstrate that, although Im Se (B) = 0, one nonetheless obtains 


e2? E2 as (—)” 
Im SelE) = Li LaLa T Y 
IT 


eonnm /eE f 


and discuss the meaning of this result [Sc 51, Sc 54]. 


V 
Charged leptons 


From the viewpoint of probing the basic structure of the Standard Model, the 
charged leptons constitute an attractive starting point. Since effects of the strong 
interaction are generally either absent or else play a secondary role, the theoretical 
analysis is relatively clean. Moreover, a great deal of high-quality data has been 
amassed involving these particles. Thus, charged leptons serve as an ideal system 
for defining our renormalization prescription, and for investigating the effects of 
various radiative corrections. 


V-1 The electron 


Some of the most precise tests of the Standard Model (or more exactly of QED) 
occur within the elementary electron—proton system. The renormalization program 
for the theory has been introduced in Sect. II-1, where it was shown how ultraviolet 
divergent contributions to such calculations can be removed by means of subtrac- 
tion from a finite number of suitably constructed counterterms. Here we examine 
the finite pieces which remain after such subtractions and compare theory with 
experiment. 


Breit-Fermi interaction 


The electromagnetic properties of the electron are studied by use of a photon probe. 
To lowest order, the gey vertex has the structure 


(e(De, AL) Jile Pe, Ae)) = —e U, ALY“ Ue, Ae), (1.1) 


and the interaction between two charged particles is governed by the exchange of 
a single virtual photon. An important example is the electron—proton interaction, 
which has the invariant amplitude! 


1 We work temporarily with an ‘ideal’ proton — a point particle having no anomalous magnetic moment. 
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= 1_ j 
Mep = e’u(p,, M)y"u(pe, re) aE Pp: Àp)Yuu (Pp, Àp), (1:2) 


where pe, p, and pp, P, are respectively electron and proton momenta and q = 
Pe — p, is the four-momentum transfer. In the following, we shall demonstrate 
how the above single-photon exchange amplitude is associated with well-known 
contributions in atomic physics. Denoting proton two-spinors with tildes, we begin 
by reducing the amplitude of Eq. (1.2) in the small-momentum limit to 


2 2 12 24 12 
Mer -S [1-H APE) re 


2 2 
8m? 8m; 


z -Pp tio - p’, x ‘.p.+io-p’ A 
«ferfit 2 2 Pel xt[i + a Pe, 


2 2 
4m; 4m% 


-Pp +P, — io x (Pp Pp) „Pe +pL— io x (Pe — pi) 
Ae e ee aS , (1.3) 
Mp 2Me 


where me, m „ are, respectively, the electron and proton masses. The various terms 
in the above expression can be interpreted physically by recalling that in Born 
approximation the transition amplitude and interaction potential are Fourier trans- 
forms of each other, 


d°q . 
Ve E HAE M eps 1.4 
p(r) J o P (1.4) 
where r = r, — rp. From the relation 
OG iarl 1 
A a (1.5) 
(27)? q? 4nr 
we recognize the leading (velocity-independent) term, 
Bats. fh 
Vcou = =z XH XX, (1.6) 
Tr 
as the Coulomb interaction between electron and proton. The identity 
=] i Lal US ene SE) (1.7) 
(27)? q? 4m? 4m? 4rr? 


allows us to recognize an additional piece of Eq. (1.3) as the spin-orbit potential, 
which is often expressed as 


1 1dVo sra a O 
TANN) tz + 


—-rXP x, (1.8) 


Vso = = - 
2m2r dr CEE g 
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but evaluated in this instance with Vọ = —e’/4zr. Combining the remaining 
O(p?/m2) terms in Eq. (1.3), we can cancel the q? term in the denominator to 
obtain the so-called Darwin potential, 


e 


2 
Vp ee ab x x (1.9) 


8m 
This term has its origin in the electric interaction between the particles, and by 
employing the Gauss’ law relation, 


V. Ecoul = e8® (r), (1.10) 


it can be re-expressed in the equivalent form 


e 


gpa V Ecou Z" xx. (1.11) 
ms 


Vp 


The spin-orbit and Darwin potentials, together with the © (p?°/m?) relativistic cor- 
rections to the electron kinetic energy, give rise to atomic fine structure energy 
effects. 

The remaining terms in the photon exchange interaction of Eq. (1.3) are effects 
produced by electron and proton current densities, the terms (pe + p/,)/2m,. and 
—io x (pe — p,,)/2m-_ representing convection and magnetization contributions, 
respectively. In particular, the interaction between magnetization densities is equiv- 
alent to the dipole-dipole potential 


e „0 a) ee 1 
Vaple—dple = 5 ae -V x ( a x x vz). (1.12) 


e 
Mp 2 4rr 

Recognizing that the magnetic field produced by the magnetic dipole moment of a 
(point) proton is 


1 
ne E v) (1.13) 


e 
Byroton = V x | — X 
prore (< A Air 


p 
we can interpret the hyperfine energy as the interaction between the electron mag- 
netic moment and the spin-induced proton magnetic field. Upon dropping the pro- 
ton and electron spinors and using the identity 


1 4r 
7 È + Taom) , (1.14) 


the dipole-dipole interaction may be written as a sum of hyperfine and tensor 
terms, 
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Table V-1. Precision tests of QED. 


Experiment Theory 


vb 1420.405 751 767(1) 1420.403(1) 

af 1159.65218076(27) 1159.65218178(77) 

al 1165.92089(54)(33) 1165.91802(2) (42) (26) 
AEP 1057845.0(9.0) 1057844.4(1.8) 

a x10 


Tn units of kHz. 


Vaple—dple = Vhyp + Viensor, 


8TA 
pe - $80), 1.15 
hyp 3memp Se Spd" AN) ( ) 
a A A 
Viensor = mmr? [3(se 4 r)(S, E) = Sgt sp] : 
eM p 


Denoting the total electron—proton spin as Stot = Se + Sp, it follows that the hyper- 
fine interaction splits the hydrogen atom ground state into components with Stot = 1 
and Stot = 0. The frequency associated with this splitting is one of the most pre- 
cisely measured constants in physics and is the source of the famous 21-cm radi- 
ation of radioastronomy. As seen in Table V—1, the experimental determination is 
about six orders of magnitude more precise than the theoretical value. Precision in 
the latter is limited by the nuclear force contribution (about 3 parts in 10°). 

Let us gather all the terms discussed thus far. In addition, we treat the proton 
and electron on an equal footing, since it will prove instructive when we discuss 
models of quark interactions in Chaps. XI-XIII. We then obtain the full one-photon 
exchange potential (Breit—Fermi interaction) for the electron—proton system, 


a 87a Ta 1 1 
= (3) 3 
Vone—photon = = F ama (r)Se - Sp + EA (r) È + x 
3(Se -T)(S,-T) —Se-S, 
nope aa ed p] 
M Se- r X Pe Sp EBs Txor 
r? 2m? 2m‘, MeM p 
e* PÊ. e)* Pp|> 1.16 
2MeM pr [p Pp + EE Pe) Pp] (1.16) 


where we recall r = re — r, and note that a spin-independent orbit—orbit interac- 
tion has been included as the final term. The single-photon exchange interaction is 
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seen to include a remarkable range of effects, all of which are necessary to under- 
stand details of atomic spectra. 


QED corrections 


Also important in precision tests of atomic systems are the higher order QED 
corrections. We have just demonstrated how the simple q? piece of the photon 
propagator leads to the Breit—Fermi interaction between electron and proton. The 
vacuum polarization correction discussed in Sect. IT-1 produces an additional com- 
ponent of the e— P interaction called the Uehling potential. From Eq. (II-1.37), we 
recall that in the on-shell renormalization scheme the subtracted vacuum polariza- 
tion TI behaves in the small-momentum limit m? >> q? as 


ce e? q? q! 
TI(g) = O 
ea? ae (4) 


By the process of Fig. II—3, this yields the contribution 


Bq ar © eg 40 4 
Vuehling (r) = Oni = q? x 60x2? m2 = 15 we ~ (r). (1.17) 


The presence of the delta function implies that S-wave states of the hydrogen atom 
are shifted by this potential while other partial waves are not. Contributions from 
the Uehling potential have been observed in scattering experiments despite its O(a) 
suppression relative to the dominant Coulomb scattering [Ve et al. 89]. 

The photon-electron vertex is also affected by radiative corrections. Let us write 
the proper (1 PJ) electron—photon vertex through first order in a as 


ie s (Po, Pe) = iey, + ieAy(Dp,, Pe) + °°", (1.18) 
where, referring to Fig. II-2(b), we have in Feynman gauge 


d*k —ig? 
2n)* k? — 1? + ic 

L L 
Bokom p,—k—m. +ie!” 


ie, (p), pe) = (ie) / 


x Vy (1.19) 


Note that a small photon mass à has been inserted to act as a cut-off in the small- 
momentum domain, and we take both incoming and outgoing electrons to obey 
p? = p? = m2. With a modest effort, the integral in Eq. (1.19) can be continued 
to d spacetime dimensions, 
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d¢k 
ied, (po, pe) = Gey in” fa fo 


n (2e — KYK + 4K (Pe + Po), — 4Meky — 4 (Pe + PL) ky +4pe - Peo 
[k — 22 + ie] [(k — py)? — p2 + ief 


(1.20) 


where py = xpe + (1 — x) p’, and the result of performing the k-integration can be 
expressed as 


ies (Po Pe) = Mi) + (h)v, (1.21) 


where (Z1), is singular in the € — 0 limit, 


o, TO) — a 
Hih = ih Go (4) - é [ ax | av 


[y> p2 + a —y)] 


(1.22) 
and (h), is not, 
= e N, 
(2) =i 2 = e+] 
(47)? (Arr) pzt+2d—y)] 
N, = y` (2e = 2) Px Yv Px T 4y? ies F p. = ~ Me(Px)v B (De Eg p.) ` Py] 
+ 4ype > Pp Yv- (1.23) 


The singular term (J;),, which arises from the Ky, term in Eq. (1.20), is infrared- 
finite, and thus the photon mass A can be dropped from it. Upon expanding (Z1), in 


powers of € and performing the y-integral, we obtain 
e fi p? 

(L) = iey —— | -+In(47)-y-1 -[ dx In + O(e)|. (1.24) 
167r? | € 0 u? 


Because (J2), is not multiplied by any quantity which is singular in €, we can 
immediately take the € — 0 limit to cast it in the form 


e? 1 1 N, 
Ih), = -i—= d d , 
= ee | -f E SEE 


N, = —2y? Pr Yy Px + 4ype- Pov (1.25) 
+4y*[(pe + pl), Px — Me(Dr)v — (Pe + PL) Payo]. 


The photon mass à can be dropped from the terms in N, proportional to y? and y? 
since they are nonsingular even if A = 0. Performing the y-integration then yields 
the result, 
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3 1 
(h), = “tas | dx py? (—PxWw Px +2pe Poy» In(p?/A) 
0 
+4 [pe gg Pov Px z Me(Px)v ~~ (De F A) ` PxYv]) : (1.26) 
The identities 
Px¥v Px =2Me(Px)v — DoW» (Pe + pl) Py =2m2—q?/2, (1.27) 


P =m, —q?x(1—x) 


allow (J), to be expressed in terms of q? = (pe — p’,)”, and the dependence of p? 
on the symmetric combination x(1 — x) implies 


1 1 1 
i dx (Px) f (P?) = 3 (pe + o), f dx f(p?). (1.28) 
0 0 


These steps, plus use of the Gordon decomposition of Eq. (C-2.8) finally lead to 
the expression 


4r? | 4e 4 


1 1 2d _ 1 1 3 2 o2 
-3f asin fee he f a Ba 
4 Jo u 2 0 m4 — q°x(1 — x) 


2m -q fa 1 In m? — q?x(1—x) 
4 0 m2 — q?x(1—x) à? 


i B o2 1 2 
io pq’ e m4 
=jel= a 1.29 

e| 2m, 8T? Jo rao an 


2 1 2: — In(4 
toler) Her |1 + f | i EN 


In the on-shell renormalization program, the electron—photon vertex 
ier (p, pe) = ieZP T, (pL, pe) (1.30) 
is constrained to obey 
lim ier” (p, Pe) = ieyy, (1.31) 


so that 


Ges e fi y —ln(4m) 1 m? 1 m? 
Zi  =1-—|-—+1- — -In — -ln 
47°? | 4e 4 4 u? 2 22 


2 — In(4 1 2 1 Z 
= 7 MS) _ e 1— Y n( T) — In Me — In Me s (1.32) 
1 Arr? 4 4 u? 2 2? 


The on-shell renormalized vertex is thus given by 


; B 
ier (p,, pe) = ie (wr (P) -L r, ()) , (133) 


2Me 
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where F\(q7) is given by a complicated expression which we do not reproduce 
here, and 


r e f m 
F- = — dx £ ʻ 1.34 
(4°) al m? — q?x(1 — x) ae) 
In addition to its original spin structure y,, the electromagnetic vertex is seen in 
Eq. (1.33) to have picked up a contribution proportional to o,gq°. The y, and oygq’ 
contributions are called the Dirac and Pauli terms respectively, and F(q?) and 
F,(q°) are the Dirac and Pauli form factors of the electron. The vertex correction 
turns out to have several important experimental consequences. 
Consider the interaction of an electron with a classical electromagnetic field for 
very small q?. Using Eq. (1.33) and the Gordon identity, we have 


Him = eA, (x) (eP) Jem (2) le (Pe) 
+ vB 2 hee 
= —eA,(x)u(p,) r = — <3 u(pe)e'4* + O (q’) 
— =! (De ar pa)” io”fqg e —ig-x 2 
= —eA,(x)u(p)) | e ° (1 + Z) u(pe)e +0 (4°). 


(1.35) 


Precision tests of QED 


Some of the most severe tests of the Standard Model have come from comparing 
theory and experiment in ever more precise determinations of electromagnetic par- 
ticle properties [MoNT 12]. Among these, the topic of lepton magnetic moments 
has achieved a deserved prominence, and we turn to this now by continuing the 
discussion of the previous section. 

The first term in Eq. (1.35) describes the coupling of the photon to the convective 
current of electron, but it is the second term which interests us here. Ignoring the 
convective term and integrating by parts, we obtain to lowest order in q, 


o”ðg A(x) e , 
7 pav —iq-x 
— 1 ew 
eu (p) mm, ( To :) u(pe)e 


oP” Fg, (x) 


=-en (p) P 


Noting that in the nonrelativistic limit of” Fg, /2 —> —o - B, we see that this is the 
coupling of a magnetic field to the electron magnetic moment. The result is usually 
expressed in terms of the gyromagnetic ratio ge}, where M, = —egeiS./2m-~, and to 
order e? we have 


e Av (Pal Jem Œ) Pe) 


e? 
(1+ 5) wipe (1.36) 


z 87 
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e =2 4 
got +0(S) ~o00106.... (1.37) 
JU IT 


2 2 


Clearly, the radiative corrections have modified the Dirac equation value, gpro = 


2. The factor œ /27, which arises from the Pauli term, is but the first of the anoma- 
lous QED contributions. 

For definiteness, let us now focus on theoretical corrections to the muon mag- 
netic moment.” The QED component, whose first nontrivial term is shown in 
Eq. (1.37), encompasses Feynman diagrams with multiple photon exchanges as 
well as charged lepton loops. It is expressible as a series in powers of a/z., 


H 


2 
ai) = = + 0.765857141007) (=) +. (1.38a) 
27 T 


Contributions through (w/zr)° have, in fact, been calculated. 
There is a smaller electroweak (EW) sector with diagrams comprising virtual 
W+, Z° and Higgs-boson exchanges. The leading order term is given by 


G m2 5 Í 2 m2 m2 
ge) = u ALS basta) +0(——)+0(—~]+.---], 
Hu 82m2 13 3 M3, M;, 


where G, and 6,, are respectively the muon decay constant and the Weinberg angle. 
We will discuss each of these later, G, in Sect. V—2 and 0w in Sect. XVI-2. 

Finally, there are important corrections from the strong interactions. It can be 
shown that these are largely influenced by effects of relatively low-energy hadronic 
physics. For example, at the QCD level the lowest order correction amounts to a 
quark—antiquark vacuum polarization, expressible in terms of either e*e~ cross 
sections or vector spectral functions from t decay (see Sect. V—3), 


a)TLO] x 10'! = 6 923(42)(3)  [ø(ete7 — hadrons] 
= 7 015(42)(19)@) [t decay]. 


Upon using the ete~ cross-section data, one finds for the total, 
(SM) __ , (QED) (EW) (Had) 
a, =a, +a ti +a ù 


= [116 584 718.09(0.15) + 154.(1)(2) + 6 923(42)(3)] x 107!! 
= 116591802(2)(42)(26) x 107!!. (1.39a) 


This amounts to a difference between experiment and theory of 
Aa, = a?) — gf) = 287(63)(49) x 107"! (1.39b) 


2 We follow the treatment of Hoecker and Marciano [RPP 12], which lists many references. 
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or 3.6 times the corresponding one-sigma error. If instead tau decay data are used, 
the discrepancy between experiment and theory is 2.40. 

The theoretical values [AOHKN 12] of the magnetic moments a, and a, for 
both electron and muon are displayed in Table V—1. There is at present no consen- 
sus about whether the theoretical predictions are in accord with the experimental 
determinations, and work continues on this subject. At any rate, these represent an 
even more stringent test of QED than the hyperfine frequency in hydrogen because 
theory is far less influenced by hadronic effects, and is thus about a factor of 10* 
more precise. 

Radiative corrections also modify the form of the Dirac coupling. One effect of 
this vertex correction is to contribute to the Lamb shift which lifts the degeneracy 
between the 2S;/2 and 2P;/2 states of the hydrogen atom. Recall that the fine- 
structure corrections, computed as perturbations of the atomic hamiltonian, give a 
total energy contribution 


(AE) fine sr = (AE) Darwin + (AE) spin—orbit + (AE )ret kin en 
7.245 x 10-*eV 1 3 

E i ( j+1/2— 3 i 

which depends only upon the quantum numbers n and j. Thus, the 2S1/2 and 2 P1/2 
atomic levels are degenerate to this order, and in fact to all orders. However, the 
vertex radiative correction breaks the degeneracy, lowering the 2 P;/2 level with 
respect to the 2;/7 level by 1010 MHz. When the anomalous magnetic moment 
coupling (+68 MHz), the Uehling vacuum polarization potential (—27 MHz), and 
effects of higher order in w/z are added to this, the result agrees with the experi- 
mental value (cf. Table V—1). Since the entire Lamb shift arises from field-theoretic 
radiative corrections, one must regard the agreement with experiment as strong 
confirmation for the validity of QED and of the renormalization prescription. 


(1.40) 


n 


The infrared problem 


Viewed collectively, the results of this section point to a remarkable success for 
QED. Yet there remains an apparent blemish — the theory still contains an infinity. 
When the photon ‘mass’ À is set equal to zero, the vertex modification of Eq. (1.29) 
diverges logarithmically due to the presence of terms logarithmic in 47. The resolu- 
tion of this difficulty lies in realizing that any electromagnetic scattering process is 
unavoidably accompanied by a background of events containing one or more soft 
photons whose energy is too small to be detected. For example, consider Coulomb 
scattering of electrons from a heavy point source of charge Ze. The spin-averaged 
cross section for the scattering of unpolarized electrons in the absence of electro- 
magnetic corrections is 
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do Zq? 1— B* sin’ 


dQ 4 ` [pel2B2 sinf 


o 
2 
7 (1.41) 
2 


where 6 = |p,|/E is the electron speed. Radiative corrections modify this result. 
Using the on-shell subtraction prescription and neglecting the anomalous magnetic- 
moment contribution, one has in the limit m? >> q’, 


do do 2a q? m 3 
Ta ie — Aji (=) -> an 1.42 
dQ ox [b+ 3 (n A 5) + | = 
from the QED vertex correction. This diverges if we attempt to take à — 0. 
However, we must also consider the bremsstrahlung process, in which the scat- 


tering amplitude is accompanied by emission of a soft photon of infinitesimal mass 
A and four-momentum k”. For ko sufficiently small, the inelastic bremsstrahlung 
process cannot be experimentally distinguished from the radiatively corrected elas- 
tic scattering of Eq. (1.42). To lowest order in the photon momentum k the invariant 
amplitude M g for bremsstrahlung is? 


Z, 
Ms = SP) [cie -ien)| u (pe) 


i 
Be t+k—me l 
iep] u(Pe) 


e 


+ u(p)) | (few) 


i éE er €E 
= M° xe (2t 2 <), (1.43) 
and has the corresponding cross section 
1 3 2 
donde ae a a oa (1.44) 
(2) 2ko pik peck 


The prime on the integral sign denotes limiting the range of photon energy, à < 
ky < AE, where AE is the detector energy resolution. The polarization sum in 
Eq. (1.44) is performed with the aid of the completeness relation for massive spin- 
one photons 


Klee 
do Dek) = Buy + 5 (1.45) 
pol 
to yield 
do do ' dk 1 2Pe* p! m? m? 
Y= e? 5 — z- —~ ]. (1-46) 
dQ dQ (27)? 2ko \pe-k peck (peck) (p: k)* 


3 For simplicity, we shall take the photon as massless in this amplitude, and at the end indicate the effect of this 
omission. 
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Performing the angular integration in Eq. (1.46) with the aid of 


2 
o 4a 
| dQ — + O02), 


(p: m Ke 
fax eh cafe x f ag Por 5 £4002 ) (1.47) 
Pl: k Pe: = 
_ 4a q g? =! 
2 
~ ( — H [ dx ¢ — nm. -x)) + OA ), 
we find 
do, do® [2a q? 2(AE) q! 
=— li —1 O| — }}. 1.48 
dQ dQ E m2" \ 4 )+ £) aa 
Adding this to the nonradiative cross section of Eq. (1.42), we obtain the finite 
result, 
do do, do® 2a q? Me 5 
= 1 l . 1.49 
da’ da dQ i+ ZS alapi s ee 


Thus, the net effect of soft-photon emission is to replace the photon mass A by the 
detector resolution 2A E, leaving a finite result. 


V—2 The muon 


The analysis just presented for the electron can just as well be repeated for the 
muon. However, the muon has the additional property of being an unstable particle, 
and in the following we shall focus entirely on this aspect. The subject of muon 
decay is important because it provides a direct test of the spin structure of the 
charged weak current. It is also important to be familiar with the calculation of 
photonic corrections to muon decay, as they are part of the process whereby the 
Fermi constant G,, is determined from experiment. 


Muon decay at tree level 


Muon decay does not proceed like the 2p — 1s + y transition in atomic hydrogen 
because the radiative process u —> e+y would conserve neither muon nor electron 
number and is predicted to be highly suppressed in the Standard Model. Indeed 
the current bound [Ad et al. (MEG collab.) 13] for this mode is extremely tiny, 
Brysery < 5.7 x 107! at 90% confidence level.’ 


4 As anticipated, the result quoted in Eq. (1.49) is not quite correct, since although we have given the photon 
an Steen mass à we have not consistently included it, as in Eq. (1.43). In a more careful evaluation the 


constant 3 : is replaced by the value 4 74 L. 
5 See Problem V-1 for a further discussion of u > e + y. 
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In fact, it is the weak transition u (p1, s) —> v,(p2) + e(p3) + Ve(p4) which is 
the dominant decay mode of the muon. In the Standard Model, this process occurs 
through W-boson exchange between the leptons. However, since the momentum 
transfer is small compared to the W-boson mass, it is possible to express muon 
decay in terms of the local Fermi interaction, 


Gy —w 
Lomi = — 5 APM yet ys PO yall + yy (2.1) 
Gy — = 
= -£ Vey" + yha Vo, Yad + yso.) (2.2) 


V2 


where the coupling constant G,, is to be considered a phenomenological quantity 
determined from the muon lifetime. At tree level, G,, is related to basic Standard 
Model parameters as in Eq. (II-3.43). The orderings in Eqs. (2.1)—(2.2) are called, 
respectively, the charge-exchange and charge-retention forms of the interaction, 
and are related by the Fierz transformation of Eq. (C—2.11). 

Let us consider the decay of a polarized muon, with rest-frame spin vector $, 
into final states in which spin is not detected. For simplicity, we set the electron 
mass to zero. The muon decay width is given in terms of a three-body phase space 
integral by 


a a’ pj 
T SeVvyVe = — — _ 2 23 
Hey Ve ay 2E; = fl Nees (Ppi = P2 = P3 — p4) > IMI, (2.3) 


52,53,54 


where in charge-exchange form, 


G 
M= Sa (Po, S2)V° (1 + ys)u (pı, S1) U3, $3) Ya (1 + ys)v(pa, $4). (2.4) 


The muon polarization is described by a four-vector s”, which equals (0, $) in the 
muon rest frame. In computing the squared matrix element, we employ 
_ 1 
Ug(P1, 81 )Ua (P1, $1) = z [n +p) -— Y59) | (2.5) 
to obtain 


XO IMI? = 64G}, (pi: pa Po > P3 — Mapa 8 p2: p3). (2.6) 


52,83,54 


The neutrino phase space integral is easily found to be 


d? p d? p4 1 
5® (0 — p — apf = — (gP 0242008), (2.7 
J SE FA (Q — pr— pa) PŽ P4 AG Q’+20°0*), (2.7) 


where Q = pı — p3. For the electron phase space, it is convenient to define a 
reduced electron energy x = E./W, where W = m,,/2 is the maximum electron 
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energy in the limit of zero electron mass. The standard notation for the electron 
spectum involves the so-called Michel parameters p, ô, & whose values depend on 
the tensorial nature of the beta decay interaction, 


er a E EE aE 
u>evuie = TOZ? ARR 


4 
~2£ cos 6 (1 -x42 E = 1))] x’dxsinodd. (2.8) 
For the V—A chiral structure of the Fermi model, we predict 
p=6=0.75, g= 1.0, (2.9a) 
in good agreement with the current experimental values [RPP 12], 


p = 0.74979 + 0.00026, 5 = 0.75047 + 0.00034, €P„8/p = 1.001819-0016 
(2.9b) 


where P, is the longitudinal muon polarization from pion decay (P,, = P,/E, =1 
in V — A theory). In making comparisons between Eq. (2.9a) and Eq. (2.9b), one 
shoud first subtract from the data corrections due to radiative effects. Upon integra- 
tion over the electron phase space, Eq. (2.8) gives rise to the well-known formula, 
25 
1 Gm, 


Tysevpie = = . 2.10 
caer e Da ens 


This relation has been used to provide an order-of-magnitude estimate for decay 
rates of heavy leptons and quarks. 


Precise determination of Gy 


Thus far, we have worked to lowest order in the local Fermi interaction and have 
assumed massless final state particles. This is not sufficient to describe results from 
modern experiments, e.g., the recent measurement of T,¢),,5, by Webber et al. 
[We etal. (MuLan collab.) 11] is 15 times as precise as any previous determination 
and provides a value of G, with an uncertainty of only 0.6 ppm. 

Including corrections to Eq. (2.10) yields 


2 m’ 
Fee joa fOr (âma), x) rw Op), (2.11a) 


where f (x) is a phase space factor, with x = m?/ m?, 


f(x) = 1 — 8x + 8x? — xt — 12x° lnx ~ 0.999812961, (2.11b) 
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Table V—-2. Determinations of Fermi-model couplings. 


Factor Determination 
Gy Muon decay 
Gg? Tau decay into lepton £ 
Gg G „ plus QED theory 
Nuclear beta deca: 
Gp Vua | y 
TEZ 
Hyperon beta deca 
Gp Vo | yp y 
Ke3 
Gp Vud Fr Pion beta decay (7te2) 
Gp Vas FK Kaon beta decay (K¢2) 


and rw (ya) = 1 + 3y,/5 +--- is a W-boson propagator correction, with y, = 
m?, /Mj,.° The quantity r,(@(m,), x) provides a perturbative expression of the 
photonic radiative corrections, 


&(m,) a? (mp) 


r, (&, x) = Hi (x) + M(x) 


where @(m,,) refers to the MS subtracted quantity 
1 

30 

The functions Hı (x), H2(x) appear, together with references to original work, in 


Chap. 10 of [RPP 12]. In the subsection to follow, we will calculate the leading- 
order contribution to r,. The above theoretical relations lead to the determination 


â(m,) ' =a! + — lnx +--+ ~ 135.901. (2.11d) 


G,, = 1.1663787(6) x 107% GeV’. (2.12) 


The above analysis serves to define the Fermi constant in the context of muon 
decay. Fermi couplings G° for the weak leptonic transitions t~ > e~ + De +0; 
and t — yw + V, + v can likewise be defined and compared with G, (see 
Sect. V—3). However, for weak semileptonic transitions of hadrons (e.g. nuclear 
beta decay) the photonic corrections are not identical to those in muon decay 
because quark charges differ from lepton charges. Such processes define instead 
a quantity called Gg, and we shall present in Sect. VI-1 a calculation of Gg 
for the case of pion decay (cf. Eq. (VII-1.31)). As seen in Table V—2, determi- 
nations involving Gg generally contain quark mixing factors and also meson decay 
6 Itis possible to study muon decay corrections either within just the Fermi effective theory or with the full 

Standard Model, For the former choice rw (yp) is omitted in Eq. (2.1 1a), whereas for the latter it is included. 

Which choice is made affects details of higher-order corrections. We have opted to follow the first edition of 


this book by including rw (yp). In fact, its effect is numerically tiny, affecting only the final decimal place in 
the value of G, given in Eq. (2.12). 
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Fig. V—1 Contributions to muon decay from (a) vertex, (b)-(c) wavefunction 
renormalization, and (d)—(e) bremsstrahlung amplitudes. 


constants. Marciano [Ma 11] has used, among other processes, semileptonic decays 
(nuclear, kaon and B-meson) and CKM unitarity to determine the Fermi constant 
without recourse to muon decay. He finds Go" = 1.166309(350) x 1075 GeV’, 
which is the second most accurate determination after the muon decay value of 
Eq. (2.12), but relatively far less accurate. 


Leading-order photonic correction 


Computation of the lowest order electron and W-boson mass corrections appearing 
in Eq. (2.10) is not difficult, and is left to Prob. V-2. However, the QED radiative 
correction is rather more formidable, and it is to that which we now turn our atten- 
tion. Rather than attempt a detailed presentation, we summarize the analysis of 
[GuPR 80]. We shall work in Feynman gauge, and employ the charge-retention 
ordering for the Fermi interaction. There is an advantage to performing the cal- 
culation as if the muon existed in a spacetime of arbitrary dimension d. Working 
in d = 4 dimensions entails factors which are logarithmic in the electron mass 
and which would forbid the simplifying assumption me = 0. Dimensional regu- 
larization frees one from this restriction, and such potential singularities become 
displayed as poles in the variable € = (4 — d)/2. Although there would appear to 
be difficulty in extending the Dirac matrix ys to arbitrary spacetime dimensions, 
this turns out not to be a problem here. The set of radiative corrections consists of 
three parts, which are displayed in Fig. V—1, (i) vertex (Fig. V—1(a)), (ii) self-energy 
(Fig. V—1(b)-(c)), and (iii) bremsstrahlung (Fig. V—1(d)—(e)). We shall begin with 
the bremsstrahlung part of the calculation and then proceed to the vertex and self- 
energy contributions. 

The amplitude for the bremsstrahlung (B) process u(pı) —> v,(p2) + e(p3) + 


Ve(p4) + y (Ps) is given by 


G 
Ms = Seay" + ys)v(pa) 


1 1 
u Q 1 Q 1 $ 
x mip] rel E ee  ggeom + rouen 
(2.13) 
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where e€ is the photon polarization vector. The spin-averaged bremsstrahlung tran- 
sition rate for d spacetime dimensions in the muon rest frame is then given by’ 


5 


1 d‘'p,; ; 1 
s= an, | (secant) 278 (0 -m pi) 5 MP. 


jJ=2 spins 


(2.14) 


where Q’ = Q — ps = pı — p3 — ps. A lengthy analysis yields a result which can 
be expanded in powers of € = (4 — d)/2 to read 


2 
e G m; 3a [ m, r2 -—e) 
192m3 4 \ 3273/2 r (f — e) TG -—©r (5 — 3€) 


6 5-6 In? 215 
i am E TE T E |i (2.15) 
e? € 2 6 


Observe that singularities are encountered as € —> 0. 
The radiative correction (R) contribution to the muon transition rate is given by 


4 
1 d?p; 1 
Tp = —— | l FS) 79725 (0 p = 3 2 
i 2M, J j=2 (om) bee (Q p2 pa) 2 [Mint 


spins 


(2.16) 


where IMIZ: is the interference term between the Fermi-model amplitudes, which 


are, respectively, zeroth order (M®) and first order (M®) in e?, 
IMIZ = MO*M® + MP*M® (2.17) 
in * 7 


The first-order amplitude can be written as a product of neutrino factors and a term 
(M pr) containing radiative corrections of the charged leptons, 


2 
G 
MY = TETP») + ys)v(pa)M%. (2.18) 


The quantity MQ is itself expressible as the sum of vertex (V) and self-energy 
(SE) contributions, 


M3, = u(ps) (M% + M%p) u(pi). (2.19) 
The vertex modification of Fig. V—1(c) 


jee! d*k y" (p3—K) y A+ ys) (pi — K+ my) Yu 
vit Oxy K? (p — k)? [(pi - k}? — m2] 


; (2.20) 


7 Since the result that we seek is finite and scale independent, in this section we shall suppress the scale 
parameter u introduced in Eq. (II-1.21b). 
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has the same form as the electromagnetic vertex correction for the electron dis- 
cussed previously (cf. Eq. (1.19)) except that it contains the weak vertex y~(1+ys). 
Upon employing the Feynman parameterization in Eq. (2.20) and using the muon 
equation of motion, the extension of the vertex amplitude to d dimensions can ulti- 
mately be expressed in terms of hypergeometric functions, 


7 A p%B + pic 
Mý = ar (3-5) 7 Gn E atrata- EC], 


(2.21) 


where 


F (3—2,1; 2;£) (d-3)F(2— 4,1: $; £) 


29° KII 


(d-3\d-2) @-4@-2) 
Aaa aLa (2.22) 
(d — 4} (d — 3) 
_d4y.d 
Fa F (3 pls sti; E) 
c- FB d2; 4+1; £) 2ra) 
E d(d — 2) (d — 2)(d — 3) 
For the muon self-energy amplitude of Fig. (V-1(d)), we write 
d My 


(2r)! q? (q? — 2p -q + p- m2)’ 


remembering that a factor of e? has already been extracted in Eq. (2.18). Imple- 
menting the Feynman parameterization and integrating over the virtual momentum 
yields an expression, 


r(2—$) 1 


1 a — x)“ 9/2 
E(p)= Ge ee dx [mid +x(2—d)p| 


4—d)/2’ 
(1 -sh )/ 
m2 


which with the aid of Eq. (C—5.5) in App. C can be written in terms of hyper- 
geometric functions, 


r2—$) 1 2d 4-d d p’ 
DOS aay? mind ("21 (T 2 m 


4 f4- 2 p 
-p -F Lat r (2.25) 
d 2 2 m? 


When the self-energy is expanded in powers of » — m,,, the leading term is just the 
mass shift, which is removed by mass renormalization. We require the p-derivative 


(2.24) 
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of X (p) evaluated at y = m„. Being careful while carrying out the differentiation 
to interpret p? factors as Pp, we find 
ax ae (2-2) 1 1-d 
Op pony (42) 4/2 nd = F 


(2.26) 


It is this quantity multiplied by the vertex y*(1 + ys) which ultimately gives rise 
to M$pg. However, in addition to mass renormalization there is also wavefunction 
renormalization, whose effect is to reduce the above quantity by a factor of 2, 
yielding 


e T=] 1 l-d 
SE (4m)1/2 m44 4(4 — d)(d — 3) 


y” + ys). (2.27) 


In principle, there also exists the electron self-energy contribution. As can be veri- 
fied by direct calculation, this vanishes because the electron is taken as massless. 
Thus, we conclude that 


d mi4 
Msg =4T (3 = 3 ary Ay” (1 + ys), 
NEE- L (2.28) 
4 (4 —d)(d —3) 


The net effect of the self-energy contribution is to replace A, in the vertex ampli- 
tude of Eq. (2.21) by A = A, + Ap. 

Insertion of the radiatively corrected amplitudes into Eq. (2.16) leads to a tran- 
sition rate l’ gR, which expanded to lowest order in € = (4 — d)/2, has the form 


—2e 
_ Gum, 3a f m, re=6 
r 


E 192m3 4 (3273/2 3 _ e)P E —3e) 
6 5-6 Sm? 5 
x (-3 += tsy -3-5 +24+0©). 229) 
e? € 2 3 


Like the bremsstrahlung contribution, the radiatively corrected decay rate is found 
to be singular in the € — O limit. However, the final result which is obtained by 
adding the radiative correction of Eq. (2.29) to that of the bremsstrahlung expres- 
sion of Eq. (2.15) is found to be free of divergences, 


G?m>. a(m,) 25 
SP QED) = T F Z H H H 2 L p 
muon = 1 RT 'B= -To oq OOF) ve) 


which is the leading-order contribution to the function r, of Eq. (2.11c). 
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V-3 The t lepton 


The heaviest known lepton is t(1777), having been discovered in e*e™ collisions 
in 1975. There exists also an associated neutrino v, with current mass limit m,, < 
18.2 MeV. Like the muon, the t can decay via purely leptonic modes, 


ie +, +0, 
tT >] e tbe thr , (3.1) 


However, a new element exists in t decay, for numerous semileptonic modes are 
also present, 


uw +v 
co | mo tate, (3.2) 


Experiment has revealed the semileptonic sector to be an important component 
of tau decay, e.g., [Am et al. (Heavy Flavor Averaging Group collab.) 12], 
P semileptonic _ 1— Brien, = Brrs pi, 


R, = ———_ = = 3.6280 + 0.0094, (3.3) 


Dresen expt Bir senv; 


where Br denotes branching ratio. It is possible to obtain a simple but naive esti- 
mate of R, as follows. Because the t is lighter than any charmed hadron, semi- 
leptonic decay amplitudes must involve the quark charged weak current 


JE = Vady” (1 + ys)u + Vas Sy" (1 + y5)u, (3.4) 


where Vaa and Vas are CKM mixing elements. Neglect of all final state masses and 
of effects associated with quark hadronization (an assumption only approximately 
valid at this relatively low energy) implies the estimates 


REN? ~ N [IVoa]? + IVasl?] = Ne — 3.0, 


(naive) Ba (naive) PEN 
Bri serv a Bra —_ 2 efi N. N = 0.2, (3.5) 
c = 


where N. is the number of quark color degrees of freedom. The above analysis 
although rough, nonetheless yields estimates for R,, T > evev, and T > LV, Vr 
in approximate accord with the corresponding experimental values. Also, it is not 
inconsistent with our belief that N. = 3. However, we can and will improve upon 
this state of affairs (cf. Eq. (3.27b)) and as a bonus will obtain a determination of 
the strong coupling a; (m+). 
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Exclusive leptonic decays 


The momentum spectra of the electron and muon modes also probe the nature of t 
decay. The Michel parameter p of Eq. (2.8) should equal 0.75 for the usual V — A 
currents, zero for the combination V + A and 0.375 for V or A separately. The 
observed value p = 0.745 + 0.008 is in accord with the V — A structure. 

The t leptonic decays afford an opportunity to test the principle of lepton univer- 
sality, i.e., the premise that the only physical difference among the charged leptons 
is that of mass. In particular, all the charged leptons are expected to have identical 
charged current weak couplings, cf. Eqs. (II-3.36),(I-3.37), 


Qe = $2 = Lur = 22 (universality condition). (3.6) 


This has been tested in [Am et al. (Heavy Flavor Averaging Group collab.) 12], 
using the following Standard Model description for the leptonic decay mode of a 
heavy lepton L, 


G.Gem> 
Proves = Saga S Ory (mi drwO), 
2 
Er a(mz) (25 2 
Gi = —— =j Z r?), 3.7 
“— 4VJ2M3 AETA = 7 e 


where f(x) is as in Eq. (2.11b) and rw (yz) = 1 + 3m4 /(5M% +- -. They find 
[Am et al. (Heavy Flavor Averaging Group collab.) 12], 


E L 1.0006 + 0.0021, ÊZ = 1.0024 + 0.0021, 22 


82u 82¢ 82¢ 


= 1.0018 + 0.0014, 
(3.8) 


consistent with the universality condition of Eq. (3.6). 

There are other ways to study the universality principle. Looking forward to 
Chap. XVI, we shall exhibit in Eq. (XVI-2.6) the result of testing lepton uni- 
versality with the decays Z? —> ££ (£ = e, m, t). Unlike the above example in 
Eq. (3.8), the Z° decay widths are functions of neutral weak coupling constants. 
Yet another approach, which uses charged current couplings, is to compare leptonic 
and semileptonic decays, like H — jv, (where H can be a pion, kaon, etc.) with 
tT > Hv. 


Exclusive semileptonic decays 


Matters are somewhat more complex for the hadronic final states, due in part to the 
large number of modes. Still, for many of these we can make detailed confrontation 
of theoretical predictions with experimental results. We begin by noting that the 
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semileptonic decay amplitude factorizes into purely leptonic and hadronic matrix 
elements of the weak current, 

Gu 
V2 
L! = (v: (P) JieplT(P)) = Vr (p')PT(p), 

i= (hadron (a 0) = (hadron|V%, ľu + Vz 5TŁu |0), (3.9) 


M semilept = i H,, 


where r% 
single meson, 


Yu(1 + ys). In the following, we analyze some modes containing a 


T — meson + v (meson =a ,K_, p (770), K* (892)) . (3.10) 


Weak-current matrix elements which connect the vacuum with spin-parity J? = 
07, 1* hadrons are sensitive to only the axial-vector current, whereas J P — 0t,17 
states arise from the vector current. In each case, the vacuum-to-meson matrix 
element has a form dictated up to a constant by Lorentz invariance, 


(x7 (q) |[dvuysu] (0)| 0) = -iV2Frqu, (3.11a) 
(K~(q) [Svu ysu] 0)| 0) = -iV2Frqu, (3.11b) 
(o~ (q; A) |[dyuu] 0)|0) = V2gpe%(q, A), (3.11c) 
(K* (q, A) | [5] ()| 0) = V2ex+€% (q, A), (3.114) 


where the quantities g, and gg» are the vector meson decay constants. These quan- 
tities contribute to the transition rates for pseudoscalar (p) and vector (v) emission, 
and we find from straightforward calculations 


G2 g 2 m2 2 m2 
Tp, = KM —— = mm? 1 — — 1+ 2— x (3.12) 
8m \m? m2 m2 


where mp, my are the meson masses, n = |Vual? for AS = 0 decay and n = |Vas|? 
for AS = 1 decay. 

It is possible to use the above formulae to extract constants such as Fy, ..., 2xK* 
from tau decay data. However, such quantities are obtained more precisely from 
other processes and, in practice, one employs them in tau decay to make branching- 
ratio predictions. Although QCD-lattice studies have steadily improved on their 
predictions of such constants, we shall focus instead on phenomenological deter- 
minations. In Chap. VII, we shall show how the values F, = 92.21 MeV and 
Fx /F = 1.197 are found from a careful analysis of pion and kaon leptonic weak 
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Table V-3. Some hadronic modes in tau decay. 


Mode Hadronic input Br [thy]* Br [expt] 


T 32H +v cı Fr 11.4 10.83 + 0.06 
t > K+), c35, FK 0.8 0.70 + 0.01 
T >p +v C18p 23.4+ 0.8 25.52 + 0.9 

t > K*4+y, C3S1 2K* 1.1+0.1 1.33 + 0.13 
tT > non nn’ a(ete— — hadr) 4.9 4.76 + 0.06 


tT —> nnl nln? o(e™e™ — hadr) 0.98 1.05 + 0.07 


Branching ratios are given in percent. 


decay. Interestingly, the hadronic matrix elements which contribute there are just 
the conjugates of those appearing in Eqs. (3.11a), (3.11b). By contrast, the quantity 
8, is obtained not from weak decay data, but rather from an electromagnetic decay 
such as p? — eë. That the same quantity g, should occur in both weak and electro- 
magnetic transitions is a consequence of the isospin structure of quark currents. 
That is, the electromagnetic current operator is expressed in terms of octet vector 
current operators by J" = vi + We v. Since the latter component is an isotopic 
scalar whereas the p meson carries isospin one, it follows from the Wigner—Eckart 
theorem that 
1 

af 


The transition rate for the electromagnetic decay p? — eé is given, with final state 
masses neglected, by 


(O|J2"10°(p)) = (CO1V,210°(p)) = —= (01V 10° (P)) = seu). 3.13) 


2 
4ra? f gp 
Dose = = 3 3.14 
d 3 (£ me eee 


from which we find p/m, = 0.198 + 0.009. The K*~v, mode can be estimated 
by using the flavor-SU(3) relation gx =g,. The predictions for single-hadron 
branching ratios are collected in Table V—3, and are seen to be in satisfactory agree- 
ment with the observed values. 

A somewhat different approach can be used to obtain predictions for strangeness- 
conserving modes with J’ = 17. Since matrix elements of the vector-charged 
current can be obtained through an isospin rotation from the isovector part of the 
ee annihilation cross section into hadrons, we can write for a given neutral J = 1 
hadronic final state f°, 


22 
aona a h (3.15) 
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where we have defined 


X 2738% = pp) F110) F140)" = 119.47) (—4? gu + qug) . 
f? 
(3.16) 


The t` transition into the isotopically related charged state f7, 


1 l 
If) = aE (3.17) 
is governed by the same function TI (q?) of hadronic final states as occurs in 
Eq. (3.15). Including the lepton current and relevant constants, and performing 
the integration over v, phase space yields a decay rate 


G? | Vaal Sew f"? 2 E 
Trop, = em | dq? (mz — 4°) (mz +2q°) 0; (a°), (3.18) 


where Sgw is given in Eq. (3.25c). The content of Eq. (3.18) is often expressed as 
a ratio, 
ly fy, — 3| Vaal? Sew 


as 2 A22 (=!) 
Lisenn 27am? i akhama] (a 28) Eoaea 05)» G19) 


Thus, we find, e.g., for 4x final states, the results listed in Table V—3. 

There exist numerous additional hadronic decay modes of the t lepton. Exam- 
ples include final hadronic states containing K K, K Km, etc., and it is possible 
to analyze each of these with various degrees of theoretical confidence. Another 
interesting use of the t semileptonic decay has been to confirm by inference the 
fundamental structure of the weak quark current from the absence of the mode 
tT —> m` nw. This mode, proceeding through the vector current, would violate 
G-parity invariance. Here G-parity refers to the product of charge conjugation and 
a rotation by z radians about the 2-axis in isospin space, 


G = Ce., (3.20) 


A weak current which could induce a AG Æ O transition is referred to as a 
second-class current. Such currents do not occur naturally within the quark model. 
The x~ nv, mode has not been detected, with an existing sensitivity [RPP 12] of 
Brrorqv, < 9.9 X 1075. This result, consistent with the absence of second-class 
currents, fits securely within the framework of the Standard Model. 


Inclusive semileptonic decays 


The inclusive semileptonic decay of the tau is denoted as tT —> v, + X, where X 
represents the sum over all kinematically allowed hadronic states. Let us restrict 
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our attention to the Cabibbo-allowed component, i.e., decay into an even or odd 
number of pions. The decay rate at invariant squared-energy s is 


dT [t >v | 


odd 
ds 
GV 
Ludy 2 2 2 e al 0 ) 
= mo =s m + 2s +m ; 3.21 
cana ) [ sae ps. a 


as expressed in terms of the so-called vector and axial-vector spectral functions, 
spin-one py (s), pa(s) and spin-zero pP (s). 

We can gain some physical understanding of the spectral functions by first study- 
ing the propagator i A(x) for a free, scalar field g(x) (cf. Eq. (C—2.12)). Its Fourier 
transform, 


1 
N(q*) = ——— with  ZmI(q*) = 76 (q*— 1’), (3.22) 
M — q= te 
reveals that the free field (x) excites the vacuum to just the single state with q? = 
i”. For the V — A currents which induce the inclusive tau decay, the momentum 


space propagators are written 
if ax eiT (O|T (V5 (x) V3 (0) — AS (x) A3(0)) 10) 


v v 1 1 v 0 
= (q"q" — q8) (IY, - 1) 0D - nRa, 623) 
where m, Ty and ny. are respectively the spin-one and spin-zero correlators. 
The spectral functions are proportional to the imaginary parts of the corresponding 
correlators, 


Impas amaa Im, 6) =p, 36), (3.24) 


They encode how the isospin vector and axial-vector currents excite various n-pion 
states at invariant energy s < m2. Figure V-2 displays the V — A spectral function® 
( Pv3— P 4.3) (s) as measured in tau decay from the ALEPH collaboration. The first 
peak is from the p meson, followed by the negative a; peak and then the four-pion 
component, etc. 


Some applications of t decays 


Studies of t decays have also proved valuable in providing a measure of a,(m,). 
Such a determination is significant because the tau lepton mass is one of the lowest 
energy scales (the charm quark mass is another) at which this is possible. The 
procedure essentially amounts to performing a more careful evaluation of R, than 


8 Here and henceforth we abbreviate Pv,3 > py and p43 > pa. 
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Fig. V-2 Authors’ representation of ALEPH data for the V — A spectral functions 
from tau decay. 


the naive reasoning leading to the result in Eq. (3.5). Recall that we have previously 
displayed in Eq. (3.3) the measured value of R,. If we restrict ourselves to the 
Cabibbo-allowed decays (notationally, R, > R'‘) by subtracting off the Cabibbo 
suppressed transitions, then experiment gives [Pi 13] 


R? = 3.4771 + 0.0084. (3.25a) 
On the theory side, a careful analysis of tau decays yields 
RZ = Nc | Vaal’ Sew [1 + dnp + 5p], (3.25b) 


where Spw is an electroweak correction, 


) +- -+ = 1.0201 + 0.0003 (3.25c) 


and ônp ~ —0.0059 + 0.0014 represents the nonperturbative QCD corrections. 
These two are insignificant compared to ôp, the perturbative QCD correction, whose 
numerical value is inferred by comparing the above experimental and theoretical 
relations, 


dp = 0.2030 + 0.0033, (3.25d) 


amounting to a 20% effect. To derive a theoretical expression for dp, one first 
expresses R"! as the contour integral [BrNP 92] 


ds ge 
R" = 6rxi 2 l= 
7 f m? ( = 


x (0 + 25) my- 2 1%s)| , (3.26) 
m m 


T T 
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where the presence of I1‘'*) in the above is associated with requiring singularity- 
free behavior in the complex-gq? plane upon passing to the chiral limit. We shall not 
detail the next steps, which involve use of the operator-product expansion (OPE) 
for 1+ (s) on the circle |s| = m2. However, the physical picture which emerges 
is akin to that of the quark—antiquark loop in electromagnetic vacuum polarization 
(cf. Fig. II-2(a)), except now the currents are the weak currents, and the photon- 
exchange perturbations of the EM case become instead gluon exchanges. The result 
of this is expressed as the series 


õp =D) Kn A (as) (3.27a) 
n=l 
where Kı = 1, Kə = 1.63982, etc., and 
1 ds (a,(—s)\" j 4 
A” (as) =Z wes (=) (1 ea) 2 4.9 ow ) .  (3.27b) 
2m Jism $ T m2 m? ms 


The a,(—s) dependence is expressible in terms of a,(m,), which finally can be 
determined in terms of the experimentally measured R“‘. Depending on details of 
the analysis, some scatter occurs in the value found for a, (m2). The averaging in 
[Pi 13] (see also [Bo et al. 12]) arrives at 


æs (m2) = 0.334 + 0.014, (3.28) 


having about 4% uncertainty. Renormalization group running of this result up to 
the Z-boson mass gives a,(Mz) = 0.1204 + 0.0016, in accord with the 2011 world 
average value a,(Mz) = 0.1183 + 0.0010. 

A rather different bit of t-related physics involves a set of sum rules which 
contain the py,,4 spectral functions [We 67a; Das et al. 67], 


> | Pv(s) = pas) ; 1 m? 
f. ds A = AL ola) + zezin (= , (3.29a) 


where Li (i) is a chiral coefficient to be defined in Sect. VII-2 and quantified in 
Table VII-1, and u is an arbitrary energy scale which cancels between the two 
terms on the right-hand side, 


f ds (evs) = palsy) = FE, (3.29b) 


T 


/ ds s (py(s) — pa(s)) = 0, (3.29c) 
0 


we 16 
| ds s én (—5) (ov(s) ~ pals) = da 


302 (m+ — m9), (3.29d) 
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where A is an energy scale which is arbitrary by virtue of Eq. (3.29c). Although 
the sum rules of Eqs. (3.29a,b) hold in the physical world, those of Eqs. (3.29c,d) 
are derived in the chiral limit of massless u, d quarks, so the quantities F, and 
m+ are understood to have slightly different numerical values from their physical 
counterparts. Like the extraction of a;(m,) from t decay, these sum rules have 
been the subject of much study over time. Their convergence is sensitive to the 
spectral functions in the large s limit. This is known in the m, = mg limit to be 
(ov,3 — pa.3) (s) ~ s~3, which suffices to provide convergence. 

Another, perhaps surprising, application of the spectral functions py,3 and p43 
involves CP violation in the kaon system. This is presented in Sect. IX—3, where 
the association of these spectral functions with e’/e (a measure of direct to indirect 
CP violation) is described. 


Problems 


(1) Effective lagrangian for p —> e+ y 
In describing the decay u —> e + y, one may try to use an effective lagrangian 
L3 4 which contains terms of dimensions 3 and 4, 


L34 = 43 (eu + he) + ia, (eu + upe), 


where D, = 0, + ie Qa A, and a3, a4 are constants. 

(a) Show by direct calculation that £34 does not lead to y > e + y. 

(b) If £3.4 is added to the QED lagrangian for muons and electrons, show that 
one can define new fields wz’ and e’ to yield a lagrangian which is diago- 
nal in flavor. Thus, even in the presence of £34, there are two conserved 
fermion numbers. 

(c) At dimension 5, u — e+ y can be described by a gauge-invariant effective 
lagrangian containing constants c, d, 


Ls = zo (c + dys)u Fag + h.c. 


Obtain bounds on c, d from the present limit for y —> e + y. 
(2) Muon decay 
(a) Obtain the leading O(m? / m?) correction to the Fermi-model expression 
Eq. (2.10) for the muon decay width. 
(b) Do the same for the leading O(m? / MẸ) correction. 
(3) Vacuum polarization and dispersion relations 
The vacuum polarization TI (q°) associated with a loop containing a spin one- 
half fermion-antifermion pair, each of mass m, can be written as the sum of a 
term containing an ultraviolet cut-off A and a finite contribution I; q”), 
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; all N? 1 q? 
II (q^) = “jim -2f axx(1=yin(1- Sx -») | 


a A » 

= —lÌn— +N 2y, 
3T n m2 ony (4 ) 

(a) Show that II p (q°) is an analytic function of q? with branch point at q? = 


4m? and with Im Ñ; (q?) = aR p (q?)/3, where 


q? == 4m2 2m? + q? 
R°) = | y F 


is related to the rate for radiative pair creation via 


a 
Daraq = p FIIO FIGI) = gg” + qip EO 
f 
(b) Use Cauchy’s theorem and the result of (a) to express 
2 oo 
K aq Rr(s) 
Ty (a°) = —— ds ——————_.. 
(7°) 3m Jam2 8 (s — gq? —i€) 
(c) The form of I; (q?) given in part (a) can be re-expressed in a dispersion 
representation. First change variables in (a) to y = 1 — 2x and integrate by 
parts to obtain 


2 2 
x Qa q°(il-—y-)| d l , 
Üg = fa l _ 
ae a afi- al a 


3 2 

y q 
== dy 2 — i 
ef y 1 oo 


Then, change variables again to s = 4m?/(1 — y?) and demonstrate that 
the dispersion result of (b) obtains. 


VI 


Neutrinos 


When the Standard Model first emerged, there was no evidence of neutrino mass. 
Since only left-chiral neutrino fields are coupled to the gauge bosons, the simplest 
way to accommodate the lack of a neutrino mass was to omit any right-handed 
counterparts to the neutrino field, in which case masslessness is automatic. Because 
of the degeneracy of the three massless neutrinos, the charged weak leptonic cur- 
rent can be made diagonal and there exists no lepton analog to the CKM matrix. 
In light of evidence for neutrino mass, the most conservative response is to pos- 
tulate the existence of right-handed neutrinos, similar to the right-handed partners 
of the other fields. Because the right-handed neutrino carries no gauge charge, its 
mass may be Dirac or Majorana (or both), and it may be heavy or light. Whether 
one considers this modification to be an extension beyond the Standard Model 
or not is largely a matter of semantics. In this chapter, we will describe the rich 
physics induced by the inclusion of a right-handed neutrino. We note in passing 
that all fermion fields appearing here will be described as four-component spinors. 


VI-1 Neutrino mass 


A right-handed neutrino vr has no couplings to any of the gauge fields because its 
Standard Model charges are zero.' Nonetheless, it can enter the lagrangian in two 
ways: there can be a Yukawa coupling to lepton doublet £z plus a Higgs and there 
can be a Majorana mass term involving vg. Considering only one generation for 
the moment, these possibilities are? 


ENS my —— 
Lip = —8vl_ Pvp — i WR) ve + h.c. (1.1) 


ly R is electrically neutral (Q = 0) and like all RH particles in the Standard Model is a weak isosinglet 


(Tw3 = 0), so by Eq. (II-3.8) it has zero weak hypercharge, Yw = 0. 

2 Tn neutrino physics, a sterile neutrino is defined as one which has no interactions whatsoever with Standard 
Model particles. The right-handed neutrino discussed here is not sterile if gy 4 0 because it can then couple 
to the Higgs field as in Eq. (1.1). 
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Recall that the left-handed neutrino field is part of an SU(2); doublet and so 
can have no Majorana mass term because the combination (v,)° vz is not gauge- 
invariant. The right-handed Majorana mass could be set equal to zero by the 
imposition of a discrete symmetry (e.g. lepton number) but this is an additional 
assumption beyond the Standard Model gauge symmetries. 

When the Higgs field picks up a vacuum expectation value, this leads to a mass 
matrix of the form 


ET (z cr) es a (+) + he., (1.2) 


where the Dirac mass is mp = g,v/ ./2 and where we have used the fact that 
(Wi) WS = (Wj) Wi. The above matrix can be diagonalized by defining fields 


Va = cos 0 vr + sind v? , Vp = cos 0 vz — sind vp, (1.3) 


with tan 20 = 2mp/my. The mass terms then become 


Ma [-——— Eee: Mp [—— c 
-Loru = [Wave + Gadus] + [Gov + Gods], aA 
with 
Ma = my cos’ 6 +mpsin26 , mp = my sin? 0 — mp sin 28. (1.5) 


The mass matrix of Eq. (1.2) has one negative eigenvalue, and given the mixing 
angle this can be seen to be m,. As discussed earlier in Sect. I-3, this is kine- 
matically equivalent to a positive mass, and the eigenvalue can be made positive 
by the phase change v, —> ivp. However, we shall leave the phase unchanged as 
it would induce an unusual phase in the weak mixing matrix. Finally, inverting 
Eq. (1.3) yields the following relation between the neutrino field vz and the mass 
eigenstates vf and vp, 


vz = cos vp +sin@ vý. (1.6) 


It is this combination of the mass eigenstates which constitutes the neutrino com- 
ponent of charged and neutral weak currents first encountered in Sect. H-3. 

There are two obvious limiting cases for the mass matrix of Eq. (1.2). In one 
limit, the Majorana mass term vanishes, my = 0, with the result that the neutrino 
is a Dirac fermion with mass mp. Here, both particle and antiparticle can have 
positive helicity (right-handed) or negative helicity (left-handed), so there are four 
degrees of freedom. As noted in Sect. I-3, despite appearances, Eq. (1.4) reduces to 
the standard Dirac lagrangian in this limit (with 0 = 2/4 and ma = —mp = mp). 
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The other case is that of a very large Majorana mass my. Here, one eigenvalue 
becomes large and the other small, 
mp 
Ma = MyM, mp, = ——. (1.7) 
Mm 
The mixing angle in this case becomes tiny, 0 = mp/my < 1, so that eigenfunc- 
tions are just Va = VR, Vp = vz up to corrections of order mp/mm. Both of these 
eigenstates are Majorana fields.* There are still four degrees of freedom present, 
viz. left-handed and right-handed helicity states for each of the two self-conjugate 
neutrinos. This is the famous seesaw mechanism [GeRS 79], which has the poten- 
tial to explain the fact that the neutrinos are much lighter than the quarks and other 
leptons. As an example, given the mass constraints cited in Chap. I, at least one 
neutrino must have rest-energy in excess of 0.05 eV, and if we use m, for the cor- 
responding Dirac mass, this would be compatible with my ~ 6 x 10!° GeV. We see 
that the light field in this case is a Majorana field of the left-handed neutrino. Even 
though the direct left-handed Majorana mass term was forbidden by gauge symme- 
try, after symmetry breaking the left-handed field assumes a Majorana nature. We 
can understand this feature more directly using effective lagrangian techniques, to 
which we now turn. 


Equivalence of heavy Majorana mass to a dimension-five operator 


We have explained in Chap. IV how a heavy field may be integrated out from a 
theory. Here, we consider a process that involves two applications of the first term 
in Eq. (1.1), gvl_ Pvp, in which the right-handed neutrino is a (self-conjugate!) 
Majorana fermion. If the Majorana mass is large, ve becomes heavy and can 
accordingly be removed. We then find the following residual interaction [We 79a] 
involving just the light fields, 


| eee 
Ls =-—-€, O60 + he., (1.8) 
Am 


where 1/Ay = g?/(2my). This interaction is invariant under SU(2); gauge 
interactions because the lepton doublet and the Higgs doublet both transform in 
the same way. The fields in £5 carry a total mass dimension of five. Hence, this 
operator must have a coupling constant with the dimensions of an inverse mass, 
and so the operator cannot be part of a renormalizable lagrangian. However, in 
effective field theory, this operator is an allowed addition to the lagrangian of the 
light Standard Model fields, and it is suppressed by a single power of the heavy 
scale my. 


3 In the general case, where the mass parameters ma, mp are allowed to have arbitrary values, both neutrinos 
are Majorana. In fact, a Dirac neutrino can itself be interpreted as a pair of degenerate Majorana neutrinos. 
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Once the Higgs field picks up its vacuum expectation value, this lagrangian turns 
into a Majorana lagrangian for the left-handed neutrino, 


2 

Ls > -n + h.c., (1.9) 
reproducing the mass eigenvalue and eigenfunction calculated above via diago- 
nalization. So we see that a left-handed Majorana mass term is allowed after the 
electroweak symmetry breaking if we include operators of dimension five. Indeed, 
although we have just found the above operator by integrating out a particular 
heavy field, its existence can be more general than this particular calculation. There 
could be other theories beyond the Standard Model which might generate this 
operator. 

The properties of neutrino mass are suggestive of physics beyond the Standard 
Model, although they are not conclusive proof of that. We have seen that there is 
no conflict between the idea of neutrino mass and the symmetries of the Standard 
Model. Once one allows the possibility of right-handed neutrino fields, both Dirac 
and Majorana mass terms will occur unless one makes an additional symmetry 
assumption of lepton-number conservation, which would set the Majorana mass 
equal to zero. Even if this extra discrete symmetry were imposed, Dirac masses 
could still account for observations. However, the small magnitude of the observed 
neutrino masses is puzzling in one way or another. If the Majorana masses are 
small or zero such that Dirac masses are dominant, one would require the Yukawa 
couplings to be remarkably small — roughly a billion times smaller than the 
Yukawas for the charged leptons. On the other hand, if the Majorana mass is large, 
the neutrino masses are naturally small via the seesaw mechanism, but then one has 
to understand the large value of the Majorana scale. A Majorana mass in the range 
10° — 10!4 GeV would not match any of the scales of the Standard Model (nor 
does it match estimates of Grand Unification scales). While the present structure 
is consistent with the interactions of the Standard Model, we hope that future New 
Physics will explain the puzzles of the quark and lepton mass scales, which are 
most dramatic in the case of neutrino masses. 


VI-—2 Lepton mixing 


In the previous section we considered mass diagonalization for a single species 
of neutrino. In the Standard Model, there are, however, three generations of lep- 
tons. This means that both the Dirac and Majorana mass terms will involve 3 x 3 
matrices, mp and my. The Dirac mass matrix is, in general, complex but not Her- 
mitian, while the Majorana mass matrix will be complex symmetric. The overall 
mass matrix must be diagonalized, and there will be a resultant weak mixing matrix 
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for the charged weak current. We shall consider lepton mixing in the two limiting 
cases discussed above, first for a pure Dirac neutrino mass and then in the seesaw 
limit. 

Dirac mass: The biunitary diagonalization of the lepton mass matrices has 
already been carried out in Eqs. (II-4.1)—(I-4.7b) for the case of a pure Dirac 
mass. The results for leptons proceed analogously to those for quarks. Mixing 
between generations occurs in the leptonic charged weak current (recall that the 
lepton mass eigenstates are Ùz, = {v1, v2, v3}, and €z = {e, u, T}L), 


Ji dept) = 20, yë = 20S SS yp", = 20, VO y", (2.1) 
where 
Vv = ss: (2.2) 


is the Dirac lepton mixing matrix. As an example, the electron’s contribution to the 
charged weak current is given by 


JE) = 20L 1V + 01 2V + L3 VE ]y“eL = 20L ey"er, (23) 


which shows the neutrino vz e created in this process to be a linear combination 
of the three neutrino mass eigenstates. The lepton mixing matrix V of Eq. (2.2) 
will have the same structure as the quark mixing matrix of Eq. (II-4.17) with three 
mixing angles {6;;} and one CP-violating phase ô. 

Majorana mass: If the right-handed Majorana mass is very large, or if we invoke 
the dimension-five operator of the previous section, we see that the light eigenstate 
is a left-handed Majorana particle with mass 


j 
m; = —Mp—mp. (2.4) 
my 


Here, the factors are themselves 3 x 3 matrices and we have been careful with the 
ordering of the elements. 

The matrix mz is nondiagonal, as are the individual elements mp and my. The 
Dirac part is diagonalized as 


m§”® = §"'mpS%,. (2.5) 
Inserting the diagonalized Dirac part into the full mass matrix yields 
mz = S} C SY, (2.6) 


where the central matrix C is defined as 


—  diagevt | Qua (diag) 
C =mp ong he f (2.7) 
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The symmetric (but generally complex-valued) central matrix C can be diagonal- 
ized with a unitary matrix F, 


my 0 0 
C=Fm,F'’=FI0 m 0O0]F'. (2.8) 
0 0 m3 


The masses in the diagonal matrix m, are the physical neutrino masses. 

The PMNS matrix involves the rotations that diagonalize the mass matrices of 
the charged leptons and the neutrinos [Po 68, MaNS 62]. This also includes the 
rotation that diagonalizes the central matrix. Therefore, in terms of the quantities 
defined above, the PMNS matrix becomes 


U = Fists. (2.9) 


Like the n x n Dirac mixing matrix for quarks and leptons, the Majorana mixing 
matrix has n? real-valued parameters, of which n(n—1)/2 are angles and n(n+1)/2 
are phases. However, whereas field redefinitions remove 2n — 1 phases for the Dirac 
case, only n such phases can be removed (via redefinitions of the charged lepton 
fields) for the Majorana mixing matrix. The reason is that Majorana fields are self- 
conjugate (cf. Sect. I-3) and thus not subject to phase redefinitions.* Thus, the 
number of remaining phases in the Majorana mixing matrix is n(n — 1)/2. For 
n = 3 there are three phases, of which one is identified as the phase 6 in the Dirac 
mixing matrix and two others, œ1, @2, are commonly called Majorana phases. It 
can be shown that 


1 0 0 
U=VP, with P,=|0 e@? 0 |, (2.10) 
0 0 ae 


where the {œ;} are the Majorana phases. For convenience, we give the neutrino 
mixing matrix V”, 


—iô 


C12C13 S12C13 513€ 
(vy) __ iô id 
V™ = | —sy2003 — c1253513e — C12€23 — S12823S13¢' 593€13 |, (2.11) 
i5 i5 
$2823 — C12023813€'° —8S23C12 — $12C23813€'°  C23C13 


where Sag = Sin Og, Cag = COS Ogg (a, B = 1,2,3). 


4 Ify — ef? w, then (W) > e7!9 (h). Maintaining the Majorana condition y = y! occurs only for 
On = nz, so 6 cannot be arbitrary. 
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VI-3 Theory of neutrino oscillations 


Our current information on neutrino mass and mixing comes via the phenomenon 
of neutrino oscillations. We review the foundation of this subject in the present 
section. 


Oscillations in vacuum 


Suppose that at time ¢ = 0 an electron neutrino is produced by a weak process 
induced by the charged current J4 (e) of Eq. (2.3) and thereafter propagates as an 
eigenstate of momentum p, 


[ve(0)) > |ve(t)) = US lve F + US lve” + Uge, B.D 


where E; = (p? + m?)!/”. In this relation, the mixing matrix elements {Usg} (k = 
1,2,3) appear as complex conjugates because the neutrino field in the charged 
current is in the form of a Hermitian conjugate ve. Actually, as written Eq. (3.1) 
is theoretically tainted because the superposition cannot be a simultaneous eigen- 
state of both momentum and energy since mı #4 m # m3. However, since this 
simplified description leads to the correct oscillation phase under rather general 
conditions, we continue to use it here.° 

To proceed, we take p >> m;, implying that E; ~ p+m?/(2p) = p+m?/(2E). 
Upon replacing the time by the distance traveled, t ~ L, we obtain from Eq. (3.1), 


2_ „2 
Ive(L)) = etBt (uzm) + U% lv) exp ir] 


m2 — m? 
U: —i ———L | }. 3.2 
MAE a) 82 
Let us now truncate the description to just two neutrino flavors by working in the 
small 6;3 limit, evidently a reasonable approximation given that |U.3/U.\| = 0.16. 

Then, the amplitude A,,», (L) and probability P,,,,,(L) for remaining in the orig- 
inal weak eigenstate v,(0) at distance L become 


. a 
Any, EL) = iO) Ly =e (ay + [Ueal? exp ee) 

2 4 4 á 2 Am}, 
Pov (L) = |Ay,v, (LE)? = ch + Stn + 2ci,57, cos a : (3.3) 


5 Two recent discussions of this point appear in [KaKRV 10] and [CoGL 09], but many others have contributed 
to the topic. See references cited in [GiK 07] and [RPP 12]. 
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With a bit of algebra, we then obtain for the survival and transition probabilities 
Py,»,(L) and P,,», (L), 


Am3,L 
Pov (L) = 1 — sin? 20, sin? |, 
ove (L) sinf 2012 sin | 


25.9) a) Am, L 
Py.v, (L) = sin” 2012 sin AE : (3.4a) 


Let us comment on aspects of these important relations. The amplitude of the 
oscillation factor is sin? 2612. The oscillation phase ®2; = Am L / (4E) informs 
about the squared mass-difference Am3}, given that the energy (E) and distance 
(L) are dictated by constraints of Nature and/or by experimental design. An expres- 
sion useful for numerical work is 


Am3,[eV7] Lim] 


a > 1.267 3.4b 
21 E[MeV] (3.4b) 
Another involves defining an oscillation length LC), 
1 
sin? ©); = 50 — cos [2rL/L2?]), 
4m E E[MeV 
LOD = ZE 20m] ~ 2.48 An (3.4c) 

Am5; i Am3,[eV~] 


which is the length for obtaining a half-cycle of oscillation. If conditions are such 
that 27L « L@), oscillations will not have had a chance to occur because the 
oscillation phase is too small. Finally, we stress that Eq. (3.4a) is a result of the two- 
flavor restriction. Although ‘three-flavor’ phenomenology was already advocated 
shortly after the discovery of the t lepton [DeLMPP 80] and is currently used 
in precise analyses of neutrino data, e.g., [FoTV 12], it can happen that the two- 
flavor approach is a valid approximation in certain circumstances (see Prob. 2 at 
the end of this chapter). For example, it is often used to describe both solar mixing 
(02 > 0o, Am}, => Am) and atmospheric mixing (623 —> 6a, |Am3,| — 
[Am2 |). 

We have been considering the vacuum propagation of neutrinos. The vacuum 
evolution equation for the relativistic energy eigenstates v; and v2 as expressed in 
the energy basis is idvg/dx = Hgvg, where 


m? 
vg = A and H; = Ey 0 > [28 = : (3.5) 
v2 0 E 0 a 


6 As will be discussed in Sect. VI-4, the predicted oscillation pattern of Eq. (3.4a) was first observed in 2002 
(for electron antineutrinos) by the KamLAND collaboration. 


VI-3 Theory of neutrino oscillations 181 


The right-most matrix form in Eq. (3.5) has been obtained by expanding the energy 
in powers of the momentum, followed by the phase transformation vg — 
exp(—ipx)v,. Proceeding to the weak basis vy, 


ws (x) =U» and U= ( gast Sidis J (3.6a) 


u — sin ĝı2 cosd2 
the evolution equation can be written idvw/dx = Hwvw, where 


2 2 
Am5; Am5] 


— cos 20 sin 20 
Hw =UHU' =| 4 z E aa (3.6b) 
t sin 26> ao cos 2012 


As shown earlier in this section, the evolution in Eq. (3.6b) describes ve < vy, 
vacuum oscillations. Using the current PDG value for 612 (see Eq. (II-4.24)), we 
have from Eq. (3.6a) the numerical expressions 


Ivi} = 0.83|v.) —0.56|v,),  [v2} = 0.56]v.) + 0.83]v,,). (3.6c) 


The dominant component of |v2) resides in |v,,), a fact we will refer to in the next 
section. 


Oscillations in matter: MSW effect 


Neutrino propagation in matter is a problem of intrinsic theoretical interest. It is 
also of practical importance because many past and present experiments involve, 
in part, neutrinos traveling in the interiors of the Sun and of the Earth. In the fol- 
lowing, we consider a neutrino moving radially with position coordinate r and 
continue to employ the two-flavor description. 

For neutrino propagation in matter, a key difference with the vacuum descrip- 


tion is that the neutrinos will undergo W* and Z° exchange scattering from atomic 
electrons and quarks confined within protons and neutrons. Only elastic scattering 
in the forward direction maintains the coherence of the initial mixed v,—v,, state. In 


particular, the quark contributions cancel and it is W= exchange in the v-e inter- 
action which produces a potential difference between electron and muon neutrinos, 


AV = V (ve) — Vv) = V2GFN,(r), (3.7) 


where N,(r) is the electron number density at distance r from the origin. That 
neutrinos in matter experience this potential energy was first pointed out by 
Wolfenstein [Wo 78], who cited a well-known analogous effect in K°—K° mixing 
as neutral kaons move through nuclear matter. To properly account for the 
Wolfenstein effect, we must alter the diagonal matrix elements in Eq. (3.6b) to 
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A 2 2 
{1GrNe(r) — 21 cos 2612 Mi sin 2012 
H™ (r) = 4E 
Ami) 1 Am}, 
sin 2812 — [1G rN) + JE cos 2012 


(3.8) 


where the superscript in H™ refers to ‘matter’. Since the electron number 
density is generally spatially dependent, the above matrix H™ will have spatially 


dependent eigenvalues E.(r), 


1 aa 2 Am, 2 1/2 
Ex@)=4; (mi o) + (2 sin20.) , (3.9) 


In the discussion to follow, we shall consider neutrino propagation in the Sun. 


At the solar core r = 0, the potential energy of Eq. (3.7) becomes AV?) 
7.6 x 10-!? eV upon taking /2Gr ~ 7.63 x 10-'4eV-cm?/N,4 and N&r) 
100N, cm~? ~ 6.0 x 107 cm~3. Let us next make two working hypotheses: 


(1) We assume that the electron number density N‘°° is sufficiently large 
(M) 


Ie [2 


to 


ensure that Hj; (0) > 0 at the core. Using the value of {°° just given 
above and adopting the current PDG values for Am}, and 6), this will be 
valid for neutrinos with energy above E ~ 2 MeV. This energy is, however, 
not precisely fixed since the core is a region and not just a point. 

If indeed HAP is positive at the solar core, it becomes negative before reach- 
ing the surface (since N, vanishes at the surface) and vice versa for HY”. The 
matrix elements HAP and HSP thus cross at the point where each vanishes. 
In the limit of neglecting the off-diagonals of H™, the diagonals become 
the eigenvalues and we have the phenomenon of level crossing, familiar from 
atomic and nuclear physics. In reality, the off-diagonals do not vanish and so 
the level crossing is avoided. 


(2) We assume that propagation of an electron neutrino in the solar matter is 


adiabatic, i.e. the fractional change in the electron density of the matter is 
small per neutrino oscillation cycle. If so, a neutrino that starts in one of the 
energy eigenstates will not experience a transition as it passes through the solar 
medium. This is akin to a particle in an eigenstate of the one-dimensional infi- 
nite well maintaining its quantum state as the wall separation changes suffi- 
ciently slowly. 


Let us now follow the behavior of E..(r) from the solar core at r = 0 to the solar 
surface at r = Ro. As we move outward from the core, N,(r) will decrease’ until 


a point r = Fres is reached at which HAP (Tres) = HS” (Fies) = 0, with 


7 A popular model for the number density profile is Ne(r) = Ne(O)e~"/"0 with ro > Ro/10. 
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Table VI-1. Evolution of |v''(r)). 


r A(r) |uM(r)) 
0 m/2 |Ve) 
Ties x/4 (Ive) + Iva)) / V2 
Ro 612 sin 612|Ve) + cos 012| V4) 
2 
Nees _ Am} Cos 2012 (3.10) 


2/2G rE 
after which E,(r) grows until the surface is reached. Similarly, E_ will increase 
from r = 0 tor = Fres and decrease thereafter. The label res used here stands for 
‘resonance’, as will be explained shortly. 

The eigenstates |vM(r)) of the matrix H™ are likewise spatially dependent, 


|uM(r)) = cos 0 (r)|ve) + sinO(r)|v,z), 
|vy"(r)) = — sin 0 (r)|ve) + cos O(r)|v,), (3.11) 
as is also the associated mixing angle 0 (r) which, after some algebra, can be written 


as 


in 20 
sin 20(r) = ae 


- z (3.12) 
| (ween /N& — 1) cos? 2619 + sin? 20 


The square of this relation has the profile of a Lorentzian distribution, indicating 
the presence of a resonance [MiS 85]. 

Suppose an electron neutrino ve is created at the solar core r = 0 under the 
assumption N, (0) >> NE. Its evolution to the solar surface r = Rọ is briefly 
summarized in Table VI-1 and explained as follows. The condition N, (0) > N res) 
implies from Eq. (3.12) that 6 (0) ~ 2/2, and so from Eq. (3.11) that [jo eS |) 
at the core. Thus, a newly created electron neutrino will reside in the energy eigen- 
state |v}") as it begins its journey to the solar surface. If the matter eigenstates 
undergo adiabatic flow through the resonance, then |v}") suffers no transitions. 
As the surface is eventually approached, the electron number density decreases to 
zero, N.(Ro) = 0 and so, from Eq. (3.12), 0 (r) > 0 (Ro) & O10, the vacuum mix- 
ing angle. What is new and exciting is that electron neutrinos of sufficiently high 
energies, which are created by nuclear reactions at the solar core, have an appre- 
ciable probability for conversion to muon neutrinos by the time the solar surface is 
reached. This is, in essence, the phenomenon known as the Mikheyev—Smirnov— 
Wolfenstein (MSW) effect [Wo 78, MiS 85]. The mixing between electron and 
muon type neutrinos has occurred within the Sun and since |v2) is an energy 
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eigenstate, no further mixing occurs en route to Earth. Measurement at a detec- 
tor on Earth will yield either ve or v, according to the quantum state |v2) of 
Eq. (3.6c). 

Not all neutrinos created in the solar core will experience MSW mixing. As 
shown earlier, it may be that the neutrino energy is too small (roughly E,, < 
2 MeV) for level crossing to take place. Or the neutrino flow to the solar surface 
may not be adiabatic. The quantitative condition for adiabaticity is most stringent 
at the resonant point r = Tres, 

sin? 201. Am}, | NE® 

cos26;2 2E | Ni (res) 
where N?(r;es) is the density gradient, N’ = dN./dr, evaluated at the resonant 
point. Thus, adiabaticity will occur provided the solar electron number density 
does not change too rapidly with position. The relation in Eq. (3.13) amounts to 
demanding that the splitting between the energy eigenvalues E+ (r) of H™ (cf. 
Eq. (3.9)), which is minimal at the resonant point, nonetheless be much larger 
than the off-diagonal matrix elements of H™ (which would produce transitions 
between the energy eigenstates). We return to this subject in Sect. VI-4, where we 
further discuss solar neutrinos. 


>1, (3.13) 


CP violation 


The CP-violating phase in the PMNS matrix has physical implications in neutrino 
oscillations, relating the oscillations of neutrinos to those of antineutrinos. It is 
reasonably straightforward to use the general form of the oscillation formula to 
calculate the difference of the oscillation probabilities, 


= * *) of Amy L 
Ajj = Priv; = Pi; >; = 4 $ UaU UnU sin ( 7E ) š (3.14) 
It is less straightforward to measure this. We note that A;; vanishes unless all three 
flavors of neutrinos are involved. This can be found from direct calculation but is 
easy to understand on general principles, as the CP-violating phase can be removed 
from any 2 x 2 submatrix by redefining the fields. Moreover, the numerator in this 
asymmetry is the same for any i Æ j, 


Aj; = sin ô cos 013 sin 2013 sin 2023 sin 2012 


. [Am L iil Amî, L ou Am3,L (3.15) 
x | sin {| ———— sin sin | ———— . 
2E 2E 2E , 
where we have used Am, = —Amz},. We see that two independent mass dif- 


ferences, e.g., Am}, and Am?;, contribute. In addition, the asymmetry will pro- 
duce small corrections to the oscillations with the largest amplitudes and will 
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be most visible for oscillations where the CP-even transitions are the smallest, 
such as ve <> v,. Although uncovering CP violation in oscillations will be an 
experimental challenge, the rewards of such a measurement will be considerable. 
For example, lepton CP violation is a necessary ingredient for leptogenesis (cf. 
Sect. VI-6). 


VI-4 Neutrino phenomenology 


Determination of the set of mixing parameters {6;;} and {Am} has taken years of 
careful experimentation. This has involved a variety of neutrino sources, including 
our Sun, the Earth’s atmosphere, nuclear reactors and particle accelerators. Many 
references and detailed accounts exist in the literature.’ 


Solar and reactor neutrinos: 012 and Am2}; 


Solar neutrinos: The current evaluations [RPP 12] of the parameters sin? 26)> and 
Am}, from a three-neutrino fit give 


sin? 261. = 0.857 + 0.024 Ams, = (7.50 +0.20) x 10> eV?. (4.1) 


This represents an uncertainty of under 3%, which is one indication of how suc- 
cessful the search for these basic parameters has turned out. The earliest progress 
in this area involved the detection of solar neutrinos. It was Davis [Da 64] who used 
a chlorine detector to probe solar neutrinos and Bahcall [Ba 64] who provided the 
theoretical basis for such an ambitious undertaking. An important conceptual con- 
tribution came from Pontecorvo, who suggested testing whether leptonic charge 
was conserved, and who wrote ‘From the point of view of detection possibilities, 
an ideal object is the Sun’ [Po 68]. 

The initial intent of the chlorine experiment was actually to test physics at the 
core of the Sun. A significant achievement of solar neutrino studies has been to 
demonstrate that stars are, indeed, powered by nuclear fusion reactions. Energy 
produced by the Sun arises from thermonuclear reactions in the solar core and the 
underlying theoretical description is called the Standard Solar Model (SSM). Solar 
burning utilizes all three types of Standard Model reactions — strong, weak, and 
electromagnetic — as well as using gravity to provide the required high density. 
The primary ingredients of the SSM are: 


(1) The Sun evolves in hydrostatic equilibrium, balancing the gravitational force 
and the pressure gradient. The equation of state is specified as a function of 
temperature, density, and solar composition. 


8 Some recent examples include [AnMPS 12], [Ba 90], [BaH 13], [FoTV 12], [FoLMMPR 12], [GiK 07], 
[GoMSS 12], [HaRS 12], [KiL 13], [MoA et al. 07], and Chap. XIII in [RPP 12] by Nakamura and Petkov. 
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(2) Energy proceeds through the solar medium by radiation and convection. While 
the solar envelope is convective, radiative transport dominates the core region 
where the thermonuclear reactions take place. 

(3) The primary thermonuclear chain involves the conversion 4p — *He+2et + 
2ve. This pp chain produces 26.7 MeV per cycle, and the associated neutrino 
production rate is firmly tied to the amount of energy production. The core 
temperature and electron number density of the Sun are respectively T, ~ 
1.5 x 10’ K and N, ~ 6 x 10% cm7?. 

(4) The model is constrained to produce the observed solar radius, mass and lumi- 
nosity. The initial *He/H ratio is adjusted to reproduce the luminosity at the 
Sun’s current 4.57 Giga-year age. 


The dominant process is the ‘pp chain’, occurring in stages I > IV: 


Stage Nuclear reaction Br (%) 
I ptp > *H+et+v, 99.75 
ptetp > 7H+», 0.25 
Il 2H +p —> 7*He+y 100.00 
Il 3He +°He —> *He+2p 86.00 
or 3He +*He —> 7Be+y 14.00 
IV Bete” > JLit+v. 99.89 
Li+ p —> *+*He +t He 
or TBe+ p > ®B+y 0.11 


8B — 8Be*+e++4 »v, 


Let us isolate those processes which produce neutrinos and order them according 
to increasing maximum neutrino energy: 


Label Reaction E ax (MeV) 
pp ptp > *H+et+ve 0.42 
7Be TBe +e > Li+ w 0.86 
pep p+e+p > 7H+», 1.44 
8B 8B +> 8Be*+et++ ve 14.06 


hep *He+p > *H+et4+vr, 18.47 


The energy spectra of the pp, 8B and hep neutrinos are continuous whereas the 
pep and ‘Be neutrinos are monoenergetic. Within this general framework, there 
is, however, still a degree of theoretical uncertainty and work continues to this 
day on solar modeling. Table VI-2 (taken from [HaRS 12] and [AnMPS 12]) lists 
SSN flux predictions according to two sets, labelled GS98 and AGSSO9, and taken 
respectively from [GrS 98] and [AsBFS 09]. Note the marked decrease in flux with 
increasing neutrino energy. 
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Table VI-2. Neutrino flux in the pp chain. 


Label GS98 AGSS09 Solar data 


pp 5.98(1 + 0.006) x 107! 6.03(1 + 0.006) x 107! 6.051490") 

7Be 5.00(1 + 0.07) x 107! 4.56111 +0.07) x 107! 4.821 70-9) x 107! 
pep 1.44(1 + 0.012) x 107!  1.47(1 + 0.012) x 107! — 1.461.919) x 107? 
8B 5.58(1 + 0.13) x 107! 4.59(1 + 0.13) x 107! 5.25(1 + 0.038) x 1074 
hep  8.04(1 +0.30) x 107! — 8.3111 0.30) x107! — 


“Expressed in units of 10!°cm~*s7!. 


On the basis of such flux predictions, results from various solar neutrino exper- 
iments could be compared with the SSM. The following compilation, taken from 
[AnMPS 12], summarizes early results for the ratio of observed-to-predicted elec- 
tron neutrino flux, 


Homestake 0.34+ 0.03, Super-K 0.46 + 0.02, 
SAGE 0.59 + 0.06, Gallex,GNO 0.58 + 0.05. 


We now know that this spread of values arises from the interplay between the range 
of neutrino energies and the influence of the MSW effect. At the time, however, it 
was unclear whether the SSM itself was at fault. The issue was resolved by a series 
of experiments which probed flavor mixing of solar neutrinos while simultaneously 
testing the SSN prediction for the total solar flux. This was carried out by the SNO 
collaboration; for a summary see [Ah et al. (SNO collab.) 11]. Since the SNO 
detector employed heavy water, there was sensitivity to the three reactions: 


charged current (CC): ve +d > p+p+e 
neutral current (NC): v, +d > p+n+v (x =e, U,T) 
elastic scattering (ES): v, +e —> v +e7 (x=e,4,T). (4.2) 


In the Standard Model, only ve contributes to the CC reaction, but all neutrino 
flavors contribute, with equal rates, to the the NC reactions (and also to the ES, 
but with v, having about six times the rate of v, and v,). Early CC measurements 
found f° = (1.76 + 0.11) x 10° cm~? s~}, much less than the (then) predicted 
total flux fj = (5.05 + 0.91) x 10° cm~? s~!. Then, NC measurements obtained 
FAO = (5.09 + 0.62) x 10° cm~? s~', consistent with the f!" prediction. Within 
errors, the only reasonable explanation is that the conversion of ve —> Vu, Vv; must 
be occurring. A more recent determination of the total flux from the 8B reaction 
reduces the uncertainty, 


Di? = (5.25 £0.16 (stat) *$:13 (syst) x 10° cms“, (4.3) 
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consistent with but having smaller uncertainty than the SSN predictions of 
Table VI-2. 

In summary, the versatility of solar neutrino experiments is that they are sensitive 
to various nuclear reactions in the Sun through the measurement of different energy 
neutrinos. The survival probability for electron neutrinos to reach the Earth will 
depend on the neutrino energy E and will in part be affected by the solar MSW 
effect. The survival probability in the two-flavor description can be expressed as 
[Pa 86] 


— 1 1 
Pisi = 2 + G — Prorat cos 20 (0) COS 2612. (4.4a) 


In the above, 6 (0) represents the matter mixing angle at the point of neutrino pro- 
duction (taken here at r = 0), averaging of oscillatory behavior has been carried 
out, and Pron-adbtc describes the nonadiabatic mixing (which is sensitive to the elec- 
tron number density N,(r)) as in Eq. (3.13). 

Let us explore Eq. (4.4a) in the limits of low-energy and high-energy neutrino 
energy, while assuming just adiabatic transitions (Ppon-adbtc = 0). For very low- 
energy neutrinos, as explained previously, there is no MSW resonance and the the 
situation reduces to simple vacuum mixing, 


= 1 
Pie, = = 5 sin? 261) ~ 0.57, (4.4b) 
whereas for very energetic neutrinos, we have 0(0) ~ 2/2 and so 
Pov = sin? 20,2 ~ 0.31. (4.4c) 


For intermediate neutrino energy, the average survival probability interpolates 
smoothly between these two limits. The overall pattern is as depicted in Fig. VI-1. 
The recent experiment [Be et al. (Borexino collab.) 12a] on pep neutrinos, whose 
energy E = 1.44 MeV isat the low end of the spectrum, finds a survival probability 
Piso = 0.62 + 0.17, which is in accord with the above analysis. 

The relations in Eqs. (4.4a—c) pertain to neutrino propagation directly from the 
Sun to the Earth. This is referred to as ‘daytime’ detection, sometimes denoted by 
Fits The ‘nighttime’ probability en 
from passage through the Earth. Letting Rp and Ry represent the day and night 
counting rates, the ‘day-night’ asymmetry, 


would be sensitive to matter effects 


Ry — Rp 


Ap_n = 2—_., 
Ry + Rp 


(4.5) 
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Fig. VI-1 Average survival probability of solar neutrinos vs. neutrino energy. 
Data points represent (from left to right) pp, Be, pep, and 8B neutrinos. 


is an observable which isolates the effect of Earth matter on neutrino propagation. 
This is in distinction to the MSW effect in the Sun, which cannot be turned off. 
Several experiments, the SNO and Super-K experiments (with 8B) and Borexino 
(with 7Be) have studied the day-night effect. For example, the results [Be et al. 
(Borexino collab.) 12b], 


Ap_n = — (0.1 41.2 40.7) % [Borexino] 
= — (40+1.3+0.8) % [Super-K], (4.6) 


are consistent with the theory predictions | Ap_n| < 0.1% (Borexino) and Ap_n ~ 
—3% (Super-K), although the latter is also 2.60 from zero. 

Reactor antineutrinos: The KamLAND experiment was able to observe oscilla- 
tions of v, antineutrinos under laboratory conditions. The ve beam originates from 
nuclear beta decays from several nuclear reactors and detection is obtained via the 
inverse beta-decay process 


Det p—et+n. (4.7) 


In the KamLAND experiment, the average baseline between sources and detec- 
tor is Lo ~ 180 km and the antineutrino energy spectrum covers the approximate 
range 1 < Eş, < 7 MeV. The v, survival formula, as in Eq. (3.4), suggests plotting 
the data as a function of Lo/Es,. The result, shown in Fig. VI-2, clearly exhibits 
the oscillation pattern. This important observation yielded the most accurate deter- 
mination of Am}, at the time and continues to be a significant contributor to the 
current database.” 

9 Since properties of electron antineutrinos are being studied, it is necessary to assume the validity of CPT 


invariance to compare the KamLAND results with those from solar neutrino studies (and any other 
experiment using neutrinos and not antineutrinos). 
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Fig. VI-2 Authors’ representation of the KamLAND observation of neutrino 
oscillations. The curve represents a fit to the oscillation hypothesis. 


Atmospheric and accelerator neutrinos: 023 and |Am?,| 


Since 1996, the Super-Kamiokande experiment has utilized a 50-kiloton Cherenkov 
detector to study oscillations of so-called ‘atmospheric’ muon neutrinos. When 
high-energy cosmic rays strike the Earth’s atmosphere a multitude of secondary 
particles are produced, most of which travel at nearly the speed of light in the same 
direction as the incident cosmic ray. Many of the secondaries are pions and kaons, 
which decay into electrons, muons, and their neutrinos. Using known cross sec- 
tions and decay rates, one expects about twice as many muon neutrinos as electron 
neutrinos from the cosmic-ray events. For example, a 7* decays predominantly as 


mi —> pt +v, > et + vet dy +p, 

i.e., two muon-type neutrinos are produced but only one that is electron-type. 
Detection of these atmospheric neutrinos yielded evidence for oscillations, to wit, 
a deficit of muon-type neutrinos, but no such deficit for the electron neutrinos. 
This has since been augmented with data containing dependence on the azimuthal 
angle (and hence distance from the source) and on the neutrino energy. Because the 
deficit is of just muon neutrinos, the hypothesis is that these oscillations involve the 
conversion v, — vr. Any v, thus generated is not energetic enough to react via 
the charged current to produce a T. 

Accelerator-based efforts to probe the same oscillation parameters include the 
K2K and MINOS experiments. In particular, since 2005 MINOS has studied muon 
neutrinos originating from Fermilab and traveling 735 km through the Earth to 
a detector at the Soudan mine in the state of Minnesota. At Fermilab, an injector 
beam of protons strikes a target, producing copious numbers of pions, whose decay 
is the source of muon neutrinos. 
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Data from both the Super-Kamiokande and MINOS experiments support the 
Vy — Vr Scenario with mixing angle and mass difference given by [RPP 12]: 


sin? 26,3 > 0.95 |Am},| = 0.0023270 mr? eV?. (4.8) 


Moreover, each experiment has also studied muon antineutrino oscillations, 
finding mixing parameters consistent with these values, although less precisely 
determined. 

Finally, the T2K collaboration announcement in 2012 of the first evidence for 
Ve appearance in a v, beam has been confirmed in a recent update [Ab et al. (T2K 
collab.) 13]. The ve appearance probability at oscillation maximum is 


2 a2 ge 2 : 
Pry ve = 4073873593 f + | = 80€73C12€23812813823 ®21 sın ô, (4.9) 


Am, 
where 2; = Am3,L/(4E) and a = 2/2Gen-,E. In particular, the value of 
sin? (20,3) inferred from the data depends on whether a normal or inverted neu- 
trino mass hierarchy is assumed. This can, in turn, be compared to reactor values 
for sin? (20,3). Thus, the importance of this type of experiment lies in its sensitiv- 
ity both to the hierarchy issue and to detection of a CP-violating signal (ô Æ 0). 
Future data from the T2K and NOVA experiments have the potential for significant 
progress in our understanding of neutrino physics. 


Short-baseline studies: 073 


The last of the neutrino oscillation angles to be determined with precision is 013. 
Initial fits to mixing data indicated their smallness. This led to the concern that 
signals of neutrino CP violation, i.e., determination of the CP-violating phase ô, 
might be experimentally inaccessible. For example, recall from Eq. (3.15) that the 
CP-violating asymmetry Ajj; is linear in both sin ô and sin 013. Hence, the attempt 
to measure 6)3 took on a certain urgency. 

Following a growing number of indications that indeed 6;3 4 0, it was sev- 
eral reactor short-baseline experiments which provided the needed precision. A 
key point is that in ve disappearance experiments, with a relatively short baseline 
of roughly 1 km, the influence of sin? (2812) and Am3; on the survival probabil- 
ity pE for electron antineutrinos can safely be neglected. We then have (see 
Prob. VI-2), 


! Am? 
PEM ~ 1 — 2U}? (1 — |U131°) (1 — cos ea) 


= 1 — sin’ (26,3) sin? (1.267Am3,L/E) . (4.10) 
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Based on data from the collaborations Daya Bay [An et al. 12], RENO [Ahn et al. 
12] and Double Chooz [Abe et al. 12], the current RPP listing gives [RPP 12] 


sin? 26,3 = 0.098 + 0.013. (4.11) 


Finally, the future of short-baseline experiments has the potential for additional 
interesting findings. In particular, the inverse relation between L and Am? in the 
neutrino oscillation relations implies that a very short-baseline study (say, with L ~ 
5 — 50 m) would be sensitive to much larger values of squared mass difference 
(say, having order of magnitude Am? ~ 1 eV”) than those observed for Aa, 
and Am3,. Such a large neutrino mass difference evidently occurred in the LSND 
experiment [Ag et al. 01 (LSND collab.)], which found evidence at 3.50 for v, > 
D, oscillations with Am? > 0.2 eV. We shall not discuss this experiment further, 
except to note that, if validated, it would represent effects (e.g., one or more sterile 
neutrinos) beyond the Standard Model. 


VI-5 Testing for the Majorana nature of neutrinos 


In order to determine if the neutrino mass has a Majorana component, one can 
use the fact that Majorana masses violate lepton-number conservation. A sensitive 
measure occurs in the process of neutrinoless double beta decay. There are many 
situations in Nature where one has a nucleus which is kinematically forbidden to 
decay via ordinary beta decay, 


ZA ANA eee te Dis, (5.1) 
but which is allowed to decay via emission of two lepton pairs (2vB£), 
ZA => PAHE fe HiH De. (5.2) 


This 2v process is attributable to the pairing force in nuclei and occurs only 
for even—even nuclei. It is produced at order G4, through the exchange of two W 
bosons. When 2v66 can occur, it is also kinematically possible to have a neutrino- 
less double beta decay (0v£ 6), 


Shans, Or Ag ae. (5.3) 


However, this latter process violates lepton-number conservation by two units and 
would be forbidden if the neutrino possessed a standard Dirac mass. We will see 
that this becomes a sensitive test of the Majorana nature of neutrino mass. 

First, consider 2v88 decay. Because this involves five-body phase space as well 
as two factors of the weak coupling constant Gpr this process is very rare, with 
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Table VI-3. Half-lives of some two-neutrino 
double beta emitters. 


Nucleus Te 

as (2.0 + 0.3 + 0.2) x 10!9 
7Ge (1.7+0.2) x 107! 

136Xe (2.23 + 0.017 + 0.22) x 107! 
Ge (1.7+0.2) x 107! 


lifetimes of order 107° years. Even so, it has been observed in many nuclei, and 
some examples are cited in Table VI-3. The rate for such processes is 


2 F(Z) 
To ~ me Fo (Q/m.) |g Mer — gyMe| - , (5.4) 
| | Ei — (En) — 5Eo 
where F(Z) is a Fermi function, F (Q/m_) is a kinematic factor, 
Read (tetas a (5.5) 
a 2° 9 * 90° 1980)’ l 


and Mr, Mgr are, respectively, the Fermi and Gamow-Teller matrix elements, 
1 ere 1 rene f 
Mr =(fi3 2 li  Mer=(flz 2 n onoi 6.0 
ij ij 


In Eq. (5.4), the closure approximation has been made to represent a sum over inter- 
mediate states via an average excitation energy (E,,). The experimental 2v decay 
rates then determine these matrix elements, which unfortunately are extremely dif- 
ficult to determine theoretically. 

If the neutrino has a Majorana mass component, then neutrinoless double beta 
decay is possible. The basic weak process underlying Ovß£ decay involves the tran- 
sition W` W~ — e` e` through the Feynman diagram of Fig. VI-3. Let us initially 
treat this process by considering only one generation and invoking the mass diagon- 
alization framework of Sect. VI-2. Because the charged weak current couples to 


Fig. VI-3 The basic weak process underlying Ov decay. 
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the combination (cf. Eqs. (1.3), (1.6)) vz = cos@ v, + sin@ v£ , the exchange of 
the two neutrino eigenstates leads to a matrix element which is proportional to 


sin^? 0 ma cos?6 mp 


2 2 2? 
O?-m Q- m; 


(5.7) 


noting that the g portion of the propagator numerator vanishes due to the chiral 
relation r4; = 0. If the Majorana mass term is vanishingly small compared 
to the Dirac mass, this reaction also vanishes since in this case 0 = 7x /4 (so that 
sin = cos@) and mz = —m, = mp. Despite the apparent existence of two 
Majorana fields, the fermion-number violating transition vanishes because the two 
contributions are equal and opposite. !° 

However, if the Majorana mass term does not vanish, the transition can occur. 
Let us consider the case of the seesaw mechanism, in which ma ~ my >> Mw and 
0 ~mp/my X 1. Then the contribution of the first propagator becomes tiny, and 
a nonvanishing transition occurs due to the second propagator. The process is now 
directly proportional to the light Majorana mass my». The momentum flowing in this 
propagator is of order the electron mass, so we can neglect the mass dependence 
m? in the denominator. This leaves the transition proportional to m,/Q°. In this 
scenario the light neutrino acts like a pure Majorana fermion. 

When all three generations of neutrinos can contribute the result depends also on 
the lepton mixing matrix. If one is dealing with Majorana neutrinos, and neglects 
the neutrino mass in the denominator of the neutrino propagator, then the figure of 
merit is the averaged Majorana mass (m,) obtained by summing over all neutrino 
species, 


(m,) = 2 U? mj : (5.8) 


Note that it is the square UU of the PMNS matrix, not the usual combination U tU, 
that enters the reaction. This is because both weak currents lead to e~ emission in 
the final state. It is this feature which allows the Majorana phases œ; 2 to contribute 
to (m,). The decay rate for such a neutrinoless decay has a form analogous to that 
in Eq. (5.4), 


7 m 2i s (miyy 
Pos ~ m, Fo(Q/me)|8aMar — g,Mr| m’ (5.9) 


10 Tf one had chosen to redefine mp to be positive via a phase redefinition, as described in Sect. VI-1, there 
would be an extra phase in the weak current of vp such that the cancelation would still occur due to a factor 
of i? = —1 in the double beta decay matrix element. 


VI-6 Leptogenesis 195 
except now with the phase space factor Fo(x), 


Fo(x) 1+2 pata (5.10) 
x)= x+ — +> +>], 
D E 3° 3 30 


and nuclear matrix elements 


Mp = (fl5 2 ae Mor = (fl5 2, E oi). (5.11) 
The factor of 1/r;; in Eq. (5.11) comes from spatial dependence associated with the 
neutrino propagator in the limit that one neglects the neutrino mass in the denomi- 
nator of Eq. (5.7). 

Neutrinoless double beta decay, Ov, is a topic of considerable theoretical 
importance and is currently under investigation experimentally. As of yet no such 
mode has been observed. Present limits are (m,) < 140 — 380 meV [Ac et al. 
(EXO-200 collab.) 11] and (m,,) < 260 — 540 meV [Ga et al. (KamLAND-ZEN 
collab.) 12]. There are a number of planned experiments which aim to lower these 
bounds. 


VI-6 Leptogenesis 


The material Universe is mostly comprised of matter — protons, neutrons and 
electrons — rather than their antiparticles. The net baryon asymmetry is described 
by the ratio 

ng = Ns- NB 6 x 10-9. (6.1) 

Ny 

Because baryon number and other symmetries are violated in the Standard Model 
and in most of its extensions, it is plausible that this asymmetry was generated 
dynamically in the early Universe. Such a dynamical mechanism requires a process 
which is out of thermal equilibrium and which violates both baryon number and 
CP conservation [Sa 67]. 

If heavy right-handed Majorana neutrinos exist, as allowed by the general mass 
analysis of the Standard Model, they can generate the net baryon asymmetry. The 
basic point is that the heavy Majorana particles can decay differently to leptons 
and antileptons as they fall out of equilibrium in the early Universe through the CP 
violation that is present in the PMNS matrix, and this lepton number asymmetry 
can be reprocessed into a baryon-number asymmetry through the B + L anomaly 
of the Standard Model. 

The decay of Majorana particles need not conserve lepton number, as the Majo- 
rana mass itself violates this quantity. However, to violate CP symmetry requires a 
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rx 


Fig. VI-4 Violating CP symmetry in the lepton sector. 


specific dynamical mechanism. There can be an interference between the phases of 
the PMNS matrix and phases generated by unitarity effects for a given final state. 
To see this, consider the decay diagrams depicted in Fig. VI-4. The tree-level dia- 
grams are proportional to the Yukawa couplings, which are in general complex. 
However, this is not enough, as an overall phase leads to an unobservable effect 
when calculating decay rates. But loop diagrams with on-shell intermediate states, 
like those in Fig. VI-4, pick up extra imaginary parts from on-shell rescattering. 
Computationally, this comes from the ie in Feynman propagators. In addition, loop 
amplitudes have different PMNS phases because they sum over all the particles in 
the loop. Schematically, this is manifest in decay amplitudes as 


idx * 
An,>Hf; = 8ij + > |Lem| e" 8ik8km8mj» 
k,m 


Ay, if; = gi + D [Lim le" ™™ 97, Sim Bij: (6.2) 


km 


where the loop diagram is represented by |L,,,|e'°" with a rescattering phase due 
to on-shell intermediate states 5;,,. The weak phases in the Yukawa couplings g;; 
change sign under the change from particle to antiparticle, but the rescattering 
phase does not. We see that a differential rate develops | Ayn, f; |? — |Ay_, Aj; |? 
Æ 0 through the interference of tree and loop processes and between the different 
components of the loop diagram. 

Producing a net lepton asymmetry would not be sufficient to explain the observed 
matter asymmetry unless some of the leptons could be transformed to baryons. This 
can be accomplished through the baryon anomaly described earlier in Chap. III. 
In the early Universe, with temperatures above the weak scale, processes which 
change baryon number, but conserve B — L, can occur rapidly. This transfers some 
of the initial lepton excess into a net number of baryons. 

The detailed prediction of the amount of baryon production depends on the size 
of the CP-violating phases, as well as the masses of the heavy Majorana particles. 
While a unique set of parameters is not available, in general one needs heavy par- 
ticles of at least 10? GeV in order to reproduce the observed asymmetry. This fits 
well with the observed size of the light neutrino masses, as described earlier in 
Sect. VI-1. 
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VI-7 Number of light neutrino species 


It might seem that the subject of this section, the number of light neutrino species, 
is anon-issue. After all, the very structure of the Standard Model has each charged 
fermion paired with its own neutrino in a weak isospin doublet. Since three charged 
fermions are known to exist, so there must be three neutrinos. Let us, however, view 
this purely as an issue of experimental physics. In particular, data from Z°-decay 
[Sc et al. 06] and the cosmic microwave background (CMB) [Hi et al. (WMAP 
collab.) 13] have been used to obtain independent determinations of the number of 
‘light’ neutrino species N,,. We discuss these two approaches in turn. 


Studies at the Z’ peak 


Since final-state neutrinos are the only Standard Model particles not detected in Z° 
decay, they contribute to the so-called invisible width Tiny. In the Standard Model, 
this is predicted to be 


Lin = Pz — (Lhaa + Dee + Puy + Per) = (497.4 + 2.5) MeV, (7.1) 


where Iz is the total Z? width and Lhaa, Lee, Pan, lr are the hadronic and leptonic 
widths. But is this what is actually found experimentally? 

Several approaches have been explored using the invisible width to determine 
N,, but the one cited here has the advantage of minimizing experimental uncer- 
tainties. The trick is to work with the ratio of measured quantities Piny/T)7, 


Pin Tos 

=N, ; (7.2) 
Wi Ye / sm 
The interpretation of this relation is clear, that the measured invisible width is the 
product of the number of light neutrino species N, and the decay width into a 


single neutrino—antineutrino pair. Using data collected from the collection of LEP 
and SLD experiments,!! one finds [Sc et al. 06] 


N, = 2.984 + 0.008, (7.3) 


which is consistent with the Standard Model value of N, = 3. 


Astrophysical data 


Astrophysics supplies an independent determination of N, which, although cur- 
rently much less precise, is nonetheless of value. The issue of interest to us is 


1 Certain assumptions are made, among them that lepton universality is valid, and that the top-quark and Higgs 
masses are respectively m; = 178.0 GeV and My = 150 GeV. See [Sc et al. 06] for additional discussion. 
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that the CMB has sensitivity, in part, to the neutrino number. Some insight can be 
gained by considering the thermal content of an expanding Universe. We take the 
radiation energy density p, as 


Pr = Py + Pv, (7.4) 


referring respectively to photons (p,,) and relativistic neutrinos (p,). The photon 
and neutrino components obey the well-known thermal relations, 


Py=seT Po = eT SN vs (7.5) 


For a temperature somewhat in excess of 10 MeV, the Universe is pervaded by 
an e~, v, y plasma in thermal equilibrium via the electroweak interactions (so that 
T, = T,). As the temperature drops to about 10 MeV, the expansion rate of the Uni- 
verse starts to exceed the rate of weak interactions, causing the neutrinos to begin 
decoupling from the plasma. Still later, the process of e~ annihilation releases 
entropy to the photons, increasing their temperature relative to the neutrinos. In 
fact, T, = T,- (4/1 1)'° provided the neutrino decoupling is complete by the anni- 
hilation era. Since this is not quite true and to account for any hypothetical ‘extra 
radiation species’ (er), one introduces the effective number of relativistic species 
Nee and writes instead 


mn 7 
7 rT Nei: 7.6 
a age gee ey 


Altogether, the radiation density can be written as 


77 4\43 
Pr = Py l 8 (=) Nett | - (7.7) 


Finally, modern experiments have probed with increasing precision the CMB radi- 
ation density, which reveals conditions at the epoch of photon decoupling (redshift 
z œ 1090). Because of its contribution to p,, Neg affects various properties of 
the CMB [Hi et al. (WMAP collab.) 13], among them the peak locations of the 
baryon acoustic oscillations (BAO). The current fit with minimum uncertainty is 
found from combining data from BAO and CMB measurements [Ad et al. (Planck 
collab.) 13], 


Neg = 3.30 0.27, (7.8) 


consistent with the Standard Model determination Nett = 3.046 [MaMPPPS 05]. 


(1) 


(2) 
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Problems 


Three-generation neutrino mixing 

In three-generation mixing, the flavor (a = e, v, T) and energy (j = 1, 2, 3) 

eigenstates are related by |v,) = Uy la = Ui as in Eq. (3.1). 

(a) Show that the amplitude connecting initial and final flavor states |v} and 
|vg) is Agg = UpjDj Ul given that D is a phase factor (temporarily 


unspecified) describing the neutrino’s propagation. 
(b) Show that the transition probability is Pag = |Aag |? is expressible as 


3 
Pap =J (Up Ual +2 >) Upj Uj UarUZ,Dj Dj. 


j=l j>k 


Hint: Partition the double sum as T = eA +2 ak 
(c) Determine D i Pup. 


(d) Assume that the neutrino propagation factor can be expressed as D; D% = 


—i Am2. L/2E z i : 538 
e` AMLE Where L is the source-detector separation, E is the (relativis- 


tic) neutrino energy and, as in the text, Am? = m? — m5. Then show 


3 2 
Amọ, 
Pag = > [Ugil lUa; + 2 Yen] va L= ers] Unda 


j=) j>k 


where pa: j,k is the (CP-violating phase) of the Uj, factors. 

Two-generation 1 3 neutrino mixing 

The aim is to obtain a simple expression for the survival probability Pee(L) 
for 1 < 3 oscillations starting from the general relation derived above. We 
shall ignore CP-violating effects (and thus set 9 q; jx = 0) and use [Aml < 
|Am3 a |Am3,|, which is already known from the text. Because we wish to 
observe 1 < 3 oscillations, we take |Am3,|L/2E > 1 (Le. 27 L/L} > 1). 
We also take Am},L/2E < 1 (ie. 2mL/LĒD « 1) to suppress 1 < 2 


oscillations. Then show that the survival probability P..(L) can be written as 


Pee(L) =1- Near (1 _ |Ues|”) (1 _ cos(21 L/LSY)) ; 


osc 


Hint: You will want to make use of the unitarity property of the mixing 
matrix U. 


Vil 
Effective field theory for low-energy QCD 


At the lowest possible energies, the Standard Model involves only photons, elec- 
trons, muons, and pions, as these are the lightest particles in the spectrum. As 
we increase the energy slightly, kaons and etas become active. The light pseudo- 
scalar hadrons would be massless Goldstone bosons in the limit that the u, d, s 
quark masses vanished. We give a separate discussion of this portion of the theory 
because it is an important illustration of effective field theory and because it can be 
treated with a higher level of rigor than most other topics. 


VII-1 QCD at low energies 


The SU (2) chiral transformations, 


VLR = (‘) — exp (—iðL R` T)YL,R, (1.1) 
L,R 


almost give rise to an invariance of the QCD lagrangian for small m,, mq, but 
do not appear to induce a left-right symmetry of the particle spectrum. This is 
because the axial symmetry is dynamically broken (i.e. hidden) with the pion being 
the (approximate) Goldstone boson. Vectorial isospin symmetry, i.e., simultaneous 
SU (2) transformations of Yz and wr, remains as an approximate symmetry of the 
spectrum. 

Isospin symmetry is seen from the near equality of masses in the multiplets 
(at, z), (K+, K®), (p,n), etc. In the language of group theory, we say that 
SU (2) x SU (2)gr has been dynamically broken to SU(2)y. What is the evidence 
that such a scenario is correct? Ultimately it comes from the predictions which 
result, such as those which we detail in the remainder of this chapter. 

The effective lagrangian for pions at very low-energy has already been developed 
in Chap. IV. In particular we recall the formalism of Sect. [V-6 which includes 
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couplings to left-handed (right-handed) currents ¢,,(x) (r,,(x)), and scalar and 
pseudoscalar densities s(x) and p(x), with the resulting O(E7) lagrangian, 


F? F? , 
L =F T (D.U D"U") + Ta (xU'+Ux") , 


U =exp(it-a/F;), x =2Bo(s + ip), (1.2) 


where D,,U = ð U +if,U —iUr, and Bo is a constant. QCD in the absence of 
sources is recovered with £, = r, = p = 0 and s = m, where m is the quark mass 
matrix. 


Vacuum expectation values and masses 


With a dynamically broken symmetry, the lagrangian is invariant but the vacuum 
state does not share this symmetry. A useful measure of this noninvariance in QCD 
is the vacuum expectation value of a scalar bilinear, 


(0 |W] 0) = (0 |Y vr] 0) + (0 [pry] 0). (1.3) 
Up to small corrections, isospin symmetry implies 
(0 lžu] 0) = (0 |dd| 0). (1.4) 


Such matrix elements, if nonzero, cannot be invariant under separate left-handed 
or right-handed SU (2) transformations. Indeed, it is evident from Eq. (1.3) that 
the vacuum expectation value couples together the left-handed and right-handed 
sectors. 

One way that the vacuum expectation values of Eq. (1.4) affect phenomenology 
is through the pion mass. If the u and d quarks were massless, the pion would 
be a true Goldstone boson with my = 0. The part of the QCD lagrangian which 
explicitly violates chiral symmetry is the collection of quark mass terms, 


Hinass = —Lmass = Mtu + madd. (1.5) 


To first order in the symmetry breaking, the pion mass is generated by the expec- 
tation value of this hamiltonian, 


m? = (1 |m, itu + madd | Ty. (1.6) 


This quantity can be related to the vacuum expectation value by using the chiral 
lagrangian. Taking both the pion and vacuum matrix elements of Eq. (1.2) and 
using the notation of Sect. IV—6, we have 


j pares 
Bo = -— = (17) 


2 
x 
My + Ma 


f aL 
m2 = (m, + mg) Bo. (0 |¢q| 0) er 


202 Effective field theory for low-energy QCD 


Thus since both Bo and the quark masses are required to be positive, consistency 
requires that (0|¢q|0) be nonzero and negative. However, without a separate deter- 
mination of the quark masses (the origin of which must lie outside chiral symmetry) 
we do not know either (0|¢q|0) or m, + mg independently. 

As an aside, we note that for Goldstone bosons there is a clear answer to the 
perennial question of whether one should treat symmetry breaking in terms of a 
linear or quadratic formula in the meson mass. For states of appreciable mass, the 
two procedures are equivalent to first order in the symmetry breaking since 


6(m’) = (mo + ôm)? — m3 = 2mo Om+---. (1.8) 


However, when the symmetry expansion is about a massless limit, the m vs m? 


distinction becomes important. Because pions are bosonic fields we require their 
effective lagrangian to have the properly normalized form, 


1 
L= 5 (aun on — mim a) +o (1.9) 
The prediction for the pion mass must then have the form, 


m2 = (m, + mq)Bo + (mu + ma)’ Co + (mu — ma) Do ++ >. (1.10) 


In principle, Nature could decide in favor of either m} œ mg or m7, œ mz depend- 


ing on whether the renormalized parameter Bo vanishes or not. However, the choice 
Bo = 0 is not ‘natural’ in that there is no symmetry constraint to force this value. 
Since one generally expects a nonzero value for Bo, the squared pion mass is linear 
in the symmetry-breaking parameter m,. There is every indication that By # 0 in 
QCD. 


Quark mass ratios 


The addition of an extra quark adds to the number of possible hadrons. If the 
strange quark mass is not too large, there are additional low-mass particles associ- 
ated with the breaking of chiral symmetry. Including the quark mass terms, 


m 0 0 
Lmass = Vim + Vemyy , m=|]0 m, Of, (1.11) 
0 0 m 


the QCD lagrangian has an approximate SU (3); x SU(3)r global symmetry. If 
the u, d, s quarks were massless, the dynamical breaking of SU(3); x SU(3)p to 
vector SU (3) would produce eight Goldstone bosons, one for each generator of 
SU (3). These would be the three pions 2+, 2°, four kaons K+, K°, K°, and one 
neutral particle 7g with the quantum numbers of the eighth component of the octet. 
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Due to nonzero quark masses, these mesons are not actually massless, but should 
be light if the quark masses are not ‘too large’. 

What should the K, ng masses be? Unfortunately, QCD is unable to answer 
this question, even if we were able to solve the theory precisely. This is because 
the quark masses are free parameters in QCD, and thus must be determined from 
experiment. This means that the x, K, and ng masses can be used to determine the 
quark masses rather than vice versa. The discussion is somewhat more subtle than 
this simple statement would indicate. Quark masses need to be renormalized, and 
hence to specify their values one has to specify the renormalization prescription and 
the scale at which they are renormalized. Under changes of scale, the mass values 
change, i.e., they ‘run.’ However, quark mass ratios are rather simpler. The QCD 
renormalization is flavor-independent, at least to lowest order in the masses. In this 
situation, mass ratios are independent of the renormalization. There can be some 
residual scheme dependence through higher-order dependence of the renormaliza- 
tion constants on the quark masses. However, to first order, we can be confident 
that the mass ratio determined by the m, K, ng masses is the same ratio as found 
from the mass parameters of the QCD lagrangian. 

The content of chiral SU (3) is contained in an effective lagrangian expressed in 
terms of U = exp[i(A - g)/F] and having the same form as Eq. (1.2). The matrix 
field à - ø contained in U has the explicit representation, 


. yan + g m+ K+ 
1 aa E peed 0) alt: 0 
yo y= T zr tzm K , (1.12) 
a=1 E =i 2 
K K ~ es 


as expressed in terms of the pseudoscalar meson fields. If we choose the parameters 
in Eq. (1.2) to correspond to QCD without external sources, viz., 


s=m, p=0, D,U = 0,U, (1.13) 
the meson masses obtained by expanding to order g? are 
m=, = Bo(my+ma), mke = Bo(ms + my), 
1 
m?o _ Bo(ms +ma), m, = 3 Pom, +M, + ma). (1.14) 
Defining m% = i (m34 + m?o), we obtain from Eq. (1.14) the mass relations, 

m m? 1 (1.15a) 
= ew x LJ 

Ms 2m3 —m2 26 

1 

m? =- (4m% a m) ; (1.15b) 
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where m = (m, + ma)/2. Eq. (1.15a) demonstrates the extreme lightness of the 
u,d quark masses. Most estimates of the strange quark mass place it at around 
ms;(2 GeV) ~ 100 MeV [RPP 12], so that m ~ 4 MeV, i.e., significantly smaller 
than the scale of QCD, Agcp. Of course, the existence of very light quarks in 
the Standard Model is no more (or less) a mystery than is the existence of very 
heavy quarks. Both are determined by the Yukawa couplings of fermions to the 
Higgs boson, which are unconstrained (and not understood) input parameters of 
the theory. In any case, the small values of the u, d masses are responsible in QCD 
for the light pion, and for the usefulness of chiral symmetry techniques. 

The mass relation of Eq. (1.15b) is the Gell-Mann—Okubo formula as applied 
to the octet of Goldstone bosons [GeOR 68]. It predicts m,, = 566 MeV, not 
far from the mass of the 7(549). The small difference between these mass values 
can be accounted for by second-order effects in the mass expansion. In particular, 
mixing of the ng with an SU (3) singlet pseudoscalar produces a mass shift of order 
(ms; — Mñ)’. The difference between the predicted and physical masses is then an 
estimate of accuracy of the lowest-order predictions. 

The use of the full pseudoscalar octet allows us to be sensitive to isospin break- 
ing due to quark mass differences in a way not possible using only pions. This is 
because, to first order, the AJ = 2 mass difference m,+ — m,,0 is independent of 


the AJ = 1 mass difference mg — m,. In contrast, the kaons experience a mass 
splitting of first order in mg — m,,. In particular, the quark mass contribution to the 
kaon mass difference is 


Mq — My 
(m?o — Mics = (mg —m,) Bo = mane (mk = m7) . (1.16) 


— mMm; — mn 


In addition, there are electromagnetic contributions of the form 
2 2 2 2 
(mko — Mgt )om = Mo — Mi +. (1.17) 


This result, called Dashen’s theorem [Da 69], follows in an effective lagrangian 
framework from (i) the vanishing of the electromagnetic self-energies of neutral 
mesons at lowest order in the energy expansion, and (ii) the fact that K* and 2* 
fall in the same U-spin multiplet and hence are treated identically by the elec- 
tromagnetic interaction, itself a U-spin singlet.! By isolating the quark mass and 
electromagnetic contributions to the kaon mass difference, we can write a sum rule, 


Mg — My ( 2 2) _ | Ma — Mu 2 
~ Mk — Mq) = My 
mMm, —m Maqa Mu 
2 


= (mig = m+) — (mo =m), (LAB) 


l Recall that U -spin is the SU (2) subgroup of SU (3) under which the d and s quarks are transformed. 
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which yields 
Mq — My Mq — My 

—— ~ = 0.023, ——— = 0.29. (1.19) 
ms, — m Mq + Mu 

The u quark is seen to be lighter than the d quark, with m,/mg ~ 0.55. The 
reason why this large deviation from unity does not play a major role in low-energy 
physics is that both m, and mg are small compared to the confinement scale of 
QCD. This is, in fact, the origin of isospin symmetry, which in terms of quark 
mass is simply the statement that neither m, nor mq plays a major physical role, 
aside from the crucial fact that mz, 4 0. Why these two masses lie so close to zero 
is a question which the Standard Model does not answer. 


Pion leptonic decay, radiative corrections, and F , 


Throughout our previous discussion of chiral lagrangians, the pion decay constant 
Fx has played an important role. It is defined by the relation 


(0|A/)|x*(p)) = iF, ppd", (1.20) 


where the axial-vector current Al, is expressible in terms of the quark fields 


a 
av (1.21) 
This amplitude gives us the opportunity to display the way that the electroweak 
interactions are matched on to the low-energy strong interactions, and so we treat 
this topic in some detail. 

The pion matrix element is probed experimentally in the decays 7 — ev, and 
T —> jtv,, which are induced by the weak hamiltonian, 


Aj, = Wns 


Gr 
V2 


The decay z+ —> yt v, has invariant amplitude, 


Hw = ~= Vapa (l + ys) Wu [Po vA + ys) We + Poy’ A + ysha]. (1.22) 


GF 2 
Matuto = gaV 2F x Palio “(1 + ys)Up 
z —G r Via Frm, t(l = Ys)Vu> (1.23) 


where the Dirac equation has been used to obtain the second line. An analogous 
expression holds for 7+ —> et v,. We see here the well-known helicity suppression 
phenomenon. That is, the weak interaction current contains the left-handed chiral 
projection operator (1+ ys), which in the massless limit produces only left-handed 
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particles and right-handed antiparticles. However, such a configuration is forbidden 
in the decay of a spin-zero particle to massless jv, or ev. because the leptons 
would be required to have combined angular momentum J, = 1 along the decay 
axis. Thus the amplitudes for 7 — jv,, eve must vanish in the limit m, = 
Me = Q. Since the neutrino is always left-handed, the ut, e+ in pion decay must 
have right-handed helicity to conserve angular momentum. It is helicity flip which 
introduces the factors of m,, me. The decay rate is found to be 


2 
G2 m2 
ay, = Fy Femme lVoal” ( — =) . (1.24) 


However, before using this expression to extract the pion decay constant, one 
must include radiative corrections. We shall do this in some detail because it illus- 
trates the way to match electroweak loops onto hadronic calculations. Since a com- 
plete analysis would be overly lengthy, we present a simplified argument which 
stresses the underlying physics. 

In Chap. V we found that the radiative correction to the muon lifetime is ultravi- 
olet finite even in the approximation of a strictly local weak interaction. However, 
this is not the case for semileptonic transitions, as can be easily demonstrated. 
Consider the photon loop diagrams shown in Fig. VII-1. We divide the photon 
integration into hard and soft components. The former, which determine the ultra- 
violet properties of the diagrams, have short wavelengths à « R, where R is a 
typical hadronic size, and are sensitive to the weak interaction at the quark level. 
In Landau gauge (i.e. §=0), the ultraviolet divergences arising from the wavefunc- 
tion renormalization and vertex renormalization diagrams depicted in Fig. VII-1 
vanish. For example, the vertex term is 


av iG d'k 1 kuk, 
1ER ~ E e003 (-ew + i ) 


/2 (27)4 k? k2 
: koo o- 
x uy Ena H Y5) 73 uzii2y*(1 + ys)uı 
iGrs d'k _ [2kyk n. a ok 
a R 0493 Omni | K tya (1 + ys)u3u2y" (1 + ys)u1, 
(1.25) 
where Q;e is the electric charge of the i" particle. Using 
d'k kyk, v f d'k 1 iguy 
ay ae =e ela (1.26) 
(27r)* k® 4 (Q7)*k* = 327? 
we find that i = 0 as claimed. It is clear, employing a Fierz transforma- 


tion, that photon exchange between particles 4,1 and 2,3 is also ultraviolet-finite. 
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Fig. VII-1 Photonic radiative corrections to the weak quark—quark interaction. 


This result simply represents the nonrenormalization of the vertex of a conserved 
current found in Chap. V. The only ultraviolet divergences then arise from final- 
state and initial-state interactions, i.e., photon exchange between particles 2,4, and 
1,3, for which 


fit. ees d'k 1 kuk, 
Iki = /2 e Q402 (Qm) k2 —8uv + y 


x iy" Ky + ymumiy Ryd + ys)u3 + (2,4 > 1, 3) 


Gr e = = Qa,,A 
ne -Am Q4Q2ln A[u4Yu Yaya + ys)uzüzy"y*y* (d + ys)uı 
— 4ü4y (1 + ys)uzūzy*(1 + ys)ui] + (2,4 > 1,3). (1.27) 


Using the identity in Eq. (C—2.5) for reducing the product of three gamma matrices, 
Eq. (1.27) becomes 


3 
1g? = -MO x 5 (O42 + 030) (A/H). (1.28) 


where M) is the lowest-order vertex. However, the full calculation of the radiative 
corrections must include the propagator for the W boson as well. When the contact 
weak interaction is replaced by the W-exchange diagram and is added to that with 
the photon-exchange replaced by Z-exchange, one obtains a finite result at the 
ultraviolet end with A = mz. The integral is cut off at the lower end at some point 
uL ~ me below which the full hadronic structure must be considered. In the case 
of muon decay we have 


Q.Q,, at QO», Qu = 0. (1.29) 


Thus, as found in Chap. V, there is no divergence. On the other hand, for beta decay 
we obtain 


2 
0.0, + Qr. Qa,s = =a (1.30) 
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We observe that there exists an important difference between the beta-decay effec- 
tive weak coupling (Gg) and the muon-decay coupling (G,,) 


C=, (1+ žm). (1.31) 
T HL 
This hard-photon correction must be added to the soft-photon component, which 
can be found be evaluating the radiative corrections to a structureless (‘point’) pion 
with a high-energy cut-off up. These were calculated long ago with the result 
[Be 58, KiS 59], 


Prt ptv 
a Zia e Bœ) +3 — 6m (1.32) 
earn an My My 
where 
2 1 3 
B(x) =4 ES Tinis 1 inc? —1)—2lnx — F| 
x2 4 
oe. 3 10x? =7 15x? — 21 
+4 T ierre 7+ Inx + —— 
(x? — 1)? A(x? — 1)’ 
(1.33) 


with L(z) = i a In(1 — t) being the Spence function and x = m,/m,,. Adding 
the hard- and soft-photon contributions with yy = ur ~ mp, we find the full 
radiative correction, 


M M 
Pty SPO f ea (2% +3In Z +n = Gia me) | (1.34) 
27 My My 


Mp 
Taking Vaq from Sect. XII-—4 and iy = 3.841 x 10’ s~!, we find 
F, = 92.2 + 0.2 MeV, (1.35) 


where we have appended an uncertainty associated with possible radiative effects 
O(a/2z) that are not included in Eq. (1.34). For chiral symmetry applications in 
this book we shall generally employ the value 


F, ~ 92 MeV. (1.36) 


A clear indication of the importance of radiative corrections can be seen in the 
ratio 


Vx+-etv, 


R= (1.37) 


3 
Prt uto 
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which is strongly suppressed by the helicity mechanism discussed earlier. Appli- 
cation of the lowest-order formula given in Eq. (1.24) leads to a prediction 


2 
(0) mz |m} — m? —4 
R= = — l>] = 1.283 x 1077, (1.38) 
m, (m; — m; 


in disagreement with the measured value 
Rexpt = (1.230 + 0.004) x 10%. (1.39) 


However, when the full radiative correction given in Eq. (1.34) is employed, the 
theoretical prediction is modified to become 


(0) a My —4 
Ray =R (1 -3% m% +...) = (1.2353 + 0.0001) x 10 > (1.40) 


e 


which is consistent with the experimental value. 


VII-—2 Chiral perturbation theory to one loop 


Let us summarize the development thus far. Interactions of the Goldstone bosons 
can be expressed in terms of an effective lagrangian having the correct symme- 
try properties. To lowest order in the energy expansion, i.e., to order E?, it suf- 
fices to use the minimal lagrangian of Eq. (1.2) at tree level. In the SU (2) theory, 
this involves just the known constants F; and my. At the next order, one encoun- 
ters both the general O(E*) lagrangian, given below, and also one-loop diagrams 
[ApB 81, GaL 84, 85a]. The O(E*) lagrangian introduces new parameters, which 
must be determined from experiment. It is also necessary to give a prescription 
which allows one to handle the loop calculations. The general method is described 
in this section. 

The program is called chiral perturbation theory. If one works to order E* in the 
energy expansion, there are typically three ingredients: 


(1) the general lagrangian Lz (of order E?) which is to be used both in loop 
diagrams and at tree level, 


(2) the general lagrangian £4 (of order E*) which is to be used only at tree level, 


(3) the renormalization program which describes how to make physical predic- 
tions at one-loop level. 


The general O(E”) lagrangian has already been given in Eq. (1.2). Now we shall 
turn to the construction of the chiral SU (n) lagrangian to order E+. 
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The order Ef lagrangian 


The O(E*) lagrangian can involve either four-derivative operators or two-derivative 
operators together with one factor of the quark mass term, x ~ 2m, Bo (which 
itself is of order m2 or m7.) or products of two quark mass factors. There are four 
possible chiral-invariant terms with four separate derivatives, 


Tr (D „U D“U'D,UD”UŻ), Tr (D, U D,UÝD”UD”UŻ), 
Tr (D ,U D,UŻ) = Tr (DUO), [Tr (D U Dee). (2.1) 


Other structures, such as 
[Tr (AUD U) Tr (tut DU), (2.2) 
can be expressed in terms of these by using SU (n) matrix identities. 


For the case of SU (3), the operators in Eq. (2.1) are not linearly independent. 
The identities quoted in Eq. (II-2.17) can be used to show that 


(DCD DUD \= S[T (D,UD"U") Ý 


+ Tr (D,UD,U") - Tr (D4UD°U") — 2 Tr (D,UD*U'D,UD'U’"), 
(2.3) 


leaving only three independent operators in this class. In SU (2), a further identity, 
2Tr (D,UD“UtD,UD'U") = [Tr (D U DU’) J’, (2.4) 
leaves us with only two independent O(E*) terms. 
Another conceivable class of operators could have at least two derivatives acting 
on a single chiral matrix, such as 


Tr (D,UD*U") - Tr (UD, D”U). (2.5) 


However, since the E* lagrangian is to be used only at tree level, all states to which 
it is applied obey the equation of motion, 


D" (U'D,,U) + ; (x'U —U'x) =0. (2.6) 


This can be used to eliminate all the double-derivative operators in favor of those 
involving four single derivatives or with factors of x. 
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The remaining operators are reasonably straightforward to determine, and the 
most general O(E*) SU (3) chiral lagrangian is,” 


10 
L= > LO: 
i=l 


= Lı [Tr (D,UD“U")]’ + Lo Tr (D,UD,U") - Tr (D“UD'U") 
+ L; Tr (D,U D“U'D,U D*U') 
+ L4Tr (D,UD“U") Tr (xU* + Ux’) 
+ Ls Tr (DUDU (xUt + Ux*)) + Le [Tr RU FU 
+ Ly [Tr (xtU — Ux*)] + Lg Tr (xUtxUt + Ux'Ux*) 
+ iLg Tr (LyyD"“U D'U! + RyyD“U! D'U) + Lio Tr (LypUR“'U') , 
(2.7) 


where Lav, Ru are the field-strength tensors of external sources given in Eq. (IV— 
6.10). This is a central result of the effective lagrangian approach to the study of 
low-energy strong interactions. Much of the discussion in the chapters to follow 
will concern the above operators and involve a phenomenological determination of 
the {L;}. In chiral SU (2), three operators become redundant. 

For completeness, we note that there may also exist two combinations of the 
external fields, 


Lex = By Tr (Luo L” + Ras R”) + BoTr (x*x), 


which are chirally invariant without involving the matrix U. These do not generate 
any couplings to the Goldstone bosons and hence are not of great phenomenolog- 
ical interest. However, if one were to use the effective lagrangian to describe cor- 
relation functions of the external sources, these two operators can generate contact 
terms. 

Finally, we summarize in Table VII-1 a set of values for the low-energy con- 
stants {L;} as obtained phenomenologically via a global fit to a range of low-energy 
data [BiJ 12]. (In this extraction certain assumptions are made also about the size 
of O(p®) chiral coefficients, since they also contribute to observables.) These con- 
stants provide a characterization of the low-energy dynamics of QCD. 


The renormalization program 


The renormalization procedure is as follows. The lagrangian, £2, when expanded 
in terms of the meson fields, specifies a set of interaction vertices. These can be 


2 We are using the operator basis and notation first set down by Gasser and Leutwyler [GaL 85a]. 
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Table VII-1. Renormalized coefficients in the chiral 
lagrangian L4 given in units of 10-7 and evaluated at 
renormalization point p = m, [BiJ 12]. 


Coefficient Value Origin 

Li 1.12 + 0.20 rr scattering 
L; 2.23 + 0.40 and 

L3 —3.98 + 0.50 Kea decay 
Li 1.50 + 1.01 Fr/Fx 
L3 1.21 + 0.08 Fg/Fx 
Lg 1.17 +0.95 Fg/Fx 
Li —0.36 + 0.18 Meson masses 
Lg 0.62 + 0.16 Fr/Frx 
Lo 7.0 + 0.2 Rare pion 
Lio —5.6+ 0.2 decays 


used to calculate tree-level and one-loop diagrams for any transition of interest. 
This result is added to the contribution which comes from the vertices contained 
in the O(E*) lagrangian £4, treated at tree level only. At this stage, the result con- 
tains both bare parameters and divergent loop integrals. One needs to determine the 
parameters from experiment. The first step involves mass and wavefunction renor- 
malization, as well as renormalization of F». In addition, the parameters entering 
from £4 need to be determined from data. If the lagrangian is indeed the most gen- 
eral one possible, relations between observables will be finite when expressed in 
terms of physical quantities. All the divergences will be absorbed into defining a 
set of renormalized parameters. This fundamental result is demonstrated explicitly 
in App. B-2. 

There exists always an ambiguity of what finite constants should be absorbed 
into the renormalized parameters Ly. This ambiguity does not affect the relation- 
ship between observables, but only influences the numerical values quoted for the 
low-energy constants. Similarly, the regularization procedure for handling diver- 
gent integrals is arbitrary.> We use dimensional regularization and the renormal- 
ization prescription, 


Yi 


L; = Li = 
3272 


L 


2 
E A In(4z) + y — 1 l (2.8) 


3 Care must be taken that the regularization procedure does not destroy the chiral symmetry. Dimensional 
regularization does not cause any problems. When using other regularization schemes, one sometimes needs 
to append an extra contact interaction to maintain chiral invariance [GeJLW 71]. The problem arises due to 
the presence of derivative couplings, which imply that the interaction Hamiltonian is not simply the negative 
of the interaction lagrangian. The contact interaction vanishes in dimensional regularization. 
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where the constants y; are numbers given in Table B—1. When working to O(E*) 
the following procedure is applied. One first computes the relevant vertices from 
La and £4. There are too many possible vertices to make a table of Feynman rules 
practical. In practice, the needed amplitudes are calculated for each application. 
The vertices from £2 are then used in loop diagrams, including mass and wave- 
function renormalizations. The results may be expressed in terms of the renor- 
malized parameters of Eq. (2.8). If these low-energy constants can be determined 
from other processes, one has obtained a well-defined result. Including loops does 
add important physics to the result. The low-energy portion of the loop integrals 
describes the propagation and rescattering of low-energy Goldstone bosons, as 
required by the unitarity of the S matrix. One-loop diagrams add the unitarity cor- 
rections to the lowest-order amplitudes and in addition contain mass contributions 
and other effects from low energy. 

The effective lagrangian may be used in the context either of chiral SU(2) 
or of chiral SU (3). Because SU (2) is a subgroup of SU (3), the general SU (3) 
lagrangian of Eq. (2.7) is also valid for chiral SU (2). However, the SU (2) version 
has fewer low-energy constants, so that only certain combinations of the L; will 
appear in pionic processes. If one is dealing with reactions involving only pions at 
low energy, the kaons and the eta are heavy particles and may be integrated out, 
such that only pionic effects need to be explicitly considered. This procedure pro- 
duces a shift in the values of the low-energy renormalized constants L} such that 
the L; of a purely SU (2) chiral lagrangian and an SU (3) one will differ by a finite 
calculable amount. In this book, we shall use the SU (3) values as our basic param- 
eter set. The SU (2) coefficients can be found by first performing calculations in 
the SU (3) limit and then treating m%, m? as large. Equivalently, all may be calcu- 
lated at the same time using the background field method [GaL 85a]. The results 
are 


r r r r 1 y r l 
2L” + LẸ = 214 + L; — 3ex, Ly” = L3- 3k, 
r r r r 3 r r 
2G + LEY = 204+ L5 — Sek, Ly” = L4 — ex, 
r r r r 3 l k i 
2L” + Ly” =2L6 + L4- Fle — Tl Lw = Lin + tk, 


3 
LOLO a 3L Sa a a Ol a zi 


F? 5 m? 
24m? tirme r 


(2.9) 


where we use the superscript (2) to indicate constants in the SU (2) theory and 
define 4; = [In(m? ieee 1] /384s*. In practice, these shifts are much smaller 
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than the magnitude of the low-energy constants, so that we always simply quote 
the SU (3) value. 

Let us now calculate the mass and wavefunction renormalization constants to 
O(E*) in chiral SU (2). Setting x = (m, +m) Bo = m3, we may expand the basic 
lagrangian as 
2 


Mo 
24 F2 


1 
La = 5 [8 up — mop : p] + (9-9) 


1 
+ Sala") (9 -du9) — 9-9) (p dup)] +O"), (210) 
0 


1 
Ly= = 7/61? BL] = La o- 3” 


a [321 + 16] 5 -m?o -9 + Oy), 


where Fo denotes the value of F, prior to loop corrections. When this lagrangian is 
used in the calculation of the propagator, the terms of O(g*) in £3 will contribute to 
the self-energy via one-loop diagrams, which involve the following d-dimensional 
integrals, 


5 jel (m°) = i Ar j0) = (O|T (x) g,(x)|0), 


dk i snd d T 
2s Aad = 2) 5 
I(m") = bh Onyi 2 —m (anya (1 E 5) (m ) > 


ô jkl (m°) = —3 vi Ar jx (0) = (OIT 9. j(x) A; (x)|0), 


i 


7 d'k m? 
Iom) =u J ay og m = eg (2.11) 


These contributions can be read off from £2 by considering all possible contrac- 
tions among the O(g‘*) terms, and result in the quadratic effective lagrangian, 


1 2 5m tt 
509 9 + i o-o 


1 
fw jtoj o= 
=l P: Pa 12F2 


1 
+ gga (Bind jn — bidu) 10M2) (81d prdu p + Sie 9,0 ) 


T 


l u m3 D a). 1.2 m? D Q) 
+ 53.99 = | 1645 Ler! |- smo oF [3226 + 16L | 


lni D D) Mz 2,4 
= >70"; au [ 1+ (160) +81 Ei ami D| 


1 
= 5m 9 [1+ + (3219 + 16Ly) = a = i I(m J (2.12) 
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To one-loop order, there are no other contributions to the self energy. Observe 
that we have changed mo, Fo into m,, F; in all of the O(E*) corrections, as the 
difference between the two is of yet higher order in the energy expansion. If we 
expand in powers of d — 4 and define the renormalized pion field as g, = Z7'/* 9 
with 


a (219 +19) + m (Ieee ey ee: 
Ch ae — 1 — ln4r +In—}], 
pa a FES JT Amla- u? 
(2.13) 
then the lagrangian assumes the canonical form 
1 i 1 , 
Lett = 5 uP, -0 (o — ax Pr "Q, (2.14) 


Note that, using the definitions of the renormalized parameters, the physical pion 
mass is identified as 
8m? . . m? m? 
2 2 2 x Dr (2)r (2)r (2)r x z 
m? = m? f E pz pi Ar” or | E 5] l 
(2.15) 


The quantity Lef in Eq. (2.14) is the quadratic portion of the one-loop effective 
lagrangian. Since loop effects have already been accounted for, it is to be used 
at tree level. This is a simple application of the background field renormalization 
discussed in App. B-2. 


VII-3 The nature of chiral predictions 


In order to understand how predictions are made in effective field theory as well 
as the range of validity of the energy expansion, let us work out several examples. 
At first, these will seem to be rather obscure processes, but they are the simplest 
hadronic reactions of QCD. As the bosonic interactions of the Goldstone bosons of 
the theory, they are the cleanest processes for demonstrating the dynamical content 
of the symmetries and anomalies of QCD. 


The pion form factor 


The electromagnetic form factor of charged pions is required by Lorentz invariance 
and gauge invariance to have the form* 


(+ (po) [JL] TPD) = G2 (q?) (pi + pr)", (3.1) 


4 The neutral pion form factor is required to vanish by charge conjugation invariance. 


216 Effective field theory for low-energy QCD 


T 


(a) (b) 


Fig. VII-2 Radiative corrections to the pion form factor. 


where q” = (pi — p2)” and G,(0) = 1. The electromagnetic current may be 
identified from the effective lagrangian of Eq. (1.2) by setting £“ = r“ = eQA", 
X = 2Bom, where Q is the quark charge matrix and m is the quark mass matrix. 
To O(E*), we then find 


Be. aL 


= u 
em = eA) (g xð ohli 


1 
— 372? ? F ow) 


(2) 
9 v 
+ 5-9" Og x boh H. 


(3.2) 


2 
m 
+(y x dtp); [1629 $ BrP] a 

The renormalization of this current involves the Feynman diagrams in 
Fig. VII-2. That of Fig. VII-2(a) is simply found using the integral previously 
defined in Eq. (2.11), 


5 
= (p x dtg); I (mZ). (3.3) 


Jem ae 
2 
(2a) ed 


Evaluation of Fig. VII-2(b) is somewhat more complicated. Using the elastic 7 +77 
scattering amplitude given by £5, 


(0* (ky) (k2)|r* (pi) (P2)) 


L 
= zga 2mo+ pi + p3 + ki +k — 3 (pi — ky’), (3.4) 
0 


we compute the vertex amplitude to be 
i d*k 1 1 
3F? J (On) (k+ lq} -m2 +ie (k— lq) — m2 +ie 


x [an+ (e+ 2) + (e- Y-a (r+ oem)! 2k". 


(3.5) 


(JE 2b) = 


Upon integration, most terms drop out because of antisymmetry under k” — —k*, 
and we find 
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2i d*k K#k - (pi + p2) 
2 4 2 2 i 
F3 J Qa) Ņ Figy = m2) ((k zif = m2) 


We can evaluate this integral using dimensional regularization, 


2 M fafi A + p2)*T (1 — $) 
~ F2 (477) 4/2 % 72 > 5 1—d/2 
z Ar) (m2 — q?x (1 — x)) 


ie i 
+q"q-(pit p2) (: + 5) T (2 = 5) (m2 a q?x a =n i 
(3.7) 


(Jla) Qb) = (3.6) 


N 


(JEI Ob) = 


where as usual u is an arbitrary scale introduced in order to maintain the proper 
dimensions. On-shell, we can disregard the term in q“ since q - (pı + pr) = m2 — 
m? = 0. For the remaining piece, we expand about d = 4 to obtain 


l i 2 2 
(Jia 2b) = nF, (pit poy" f dx (m2, — q°x(1 — x)) 
2 m? — q?x(1—x) 
x [( +r- 1- m4r) +m k |: 3.8) 
and the x-integration then yields 
(JE, Jaw = —— (pi + 2)!" | (m2 — 29?) | + - 1 - Inde 
m? 1 q? 1 
In = = (q? — 4m?) H | — |= —q°} 5 3.9 
+n] + (a m3) (5) at} (3.9) 
where 


1 
H(a)= -f dx ln (1 — ax(1 — x)) 
0 


2—2,/4 —1omet/4 — (0<a<4) 


h a 
ba at [h NEN + in0(a — | (otherwise). 


Now we add everything together. The tree-level amplitude is modified by wave- 
function renormalization, 


(3.10) 


(tree) y 2 8m? (2) (2) 
ZG °) = L= a GL, +L; ) 


i E T 
i — 1 — In4x + In — 
24m2 F2 (d4 ” u? 
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BME (572. 72 obs” 
x f+ (LP + LP) t 


2 
ug m 


m2 2 m? TS 2 
=|1 z — 1 —ln4r in , 
taa (rat : x+n) + Fa 4 


(3.11) 
while Figs. VII-2(a,b) contribute as 
5m2 2 m? 
Gz (q? =- z_?_“_+y—1—In4dn+In—}, 
L 4872 F2 | d | 
Gx(q’) 2 cee ee ey 
x = —— zils — == m4r + In— 
Tlo Won? F? 6 J\la=a.* u? 
ae q’ 1 
- (@? — 4m?) H | &— ) - —q?° t}, 3.12 
+U m7) (5) R? | (3.12) 


respectively. Summing Egs. (3.11), (3.12) we see that all terms independent of q? 
cancel, LS becomes pe and the final result is 


or 1 q? m2 q” 
2% o 9 2 2 2 2 T 
G, (q) =1+ m l to | -4m u (L) -4 nme E], 
(3.13) 


The divergences have been absorbed in LS ", while the imaginary part required by 
unitarity is contained in H (q?/m2). Note that the loops also induce a non-power- 
law behavior in G,,(q”). However, numerically this turns out to be small and is 
unobservable in practice. A simple linear approximation, 


(2)r 2 
Gq =14+¢ Fe ig EN Fc. (3.14) 
F2 96n2F2\0 2 


is obtained by Taylor expanding about q? = 0. The corresponding result for chiral 
SU (3) is 


5 2 | 2L9 1 m l1 m, 3 
Gr(q°) =1+4 oe ne et +--+, (3.15) 


The pion form factor is generally parameterized in terms of a charge radius, 
1 
Ga) =1t cigar to. (3.16) 
Thus, for any given value of the energy scale jz, the parameter Lg can be determined 


from the experimental charge radius. From the present experimental value (rž) = 
(0.45 + 0.01) fm?, we obtain Lo(u = mp) = (7.0 0.2) x 107°. 
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The scale u enters the calculation in such a way that, had we used a different 
scale jz’ but kept the physical result invariant, we would have had 


a u? SUQ 
“Re N (SU(2)) 


(3.17) 
1 

B=- in| =) WU). 

7 1287? @ 


In fact, if we look back to the origin of In u in the transition from Eq. (3.7) to 
Eq. (3.8) using 


2 2 
4—-d 2 
—— = a! O(d — 4), 3.18 
eed ny + O( ) (3.18) 
we see that the scale dependence is always tied to the coefficient of the divergence. 
The general result is then 


5 


12 
ret r Vi H 
Li (uw) = Li (u) — 3272 In (£) , 68.19) 


where {y;} are the constants of Table B—1 of App. B, used in the renormalization 
condition of Eq. (2.8). 

This calculation also nicely illustrates the range of validity of the energy expan- 
sion. The pion form factor is well described by a monopole form, 
2 


Gag?) ~ 5 =1+ 24th, (3.20) 


1 
— g?/m? 
with m ~ mp770). The energy expansion is then in powers of g”/m*. At the other 
extreme, the pion form factor can also be treated in QCD when q? is very large 
[BrL 80]. 


Rare processes 


The calculation above is clearly non-predictive as it contains a free parameter, 
Lg, which must be determined phenomenologically. However, predictions do arise 
when more reactions are considered because relations exist between amplitudes as 
a consequence of the underlying chiral symmetry. In particular, there exists a set of 
reactions which are described in terms of two low-energy constants. These pionic 
reactions are shown in Table VII-2. With the additional input of Fg / Fz, the kaonic 
reactions shown there are also predicted. Each case contains hadronic form factors 
5 We have chosen to keep the low-energy constants {L;} dimensionless in the extension to d dimensions. In 


[GaL 85a], the constants have dimension 4174. However, the resulting physics is identical in the limit 
d —> 4. 
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Table VII-2. The radiative complex of pion and 
kaon transitions. 


Pions Kaons 

y > nTn y > K~Kt 

ynt > ynt yKt > yK* 

mE —> € vey KT — e™vey 

a+ + w%ety, K > metyv, 

mt + eTyete Kt > etyete 
05+ 


KT > we’ vey 


which need to be calculated. This section briefly describes the procedure for relat- 
ing such reactions in chiral perturbation theory. All calculations follow the pattern 
described above, so that we shall only quote the results [GaL 85a, DonH 89]. 

In the processes involving photons (7+ > etvey, n" > etv.ete” and ymt > 
yz), there are always Born diagrams where the photon couples to hadrons through 
the known rry coupling. These are shown in Fig. VII-3. In addition, there can 
be direct contact interactions associated with the structure of the pions. These 
introduce new form factors. For the decays tt > etvey, etv ete”, the matrix 
elements are 


eG _ i 
Maytservy = -7 608 81M (P. D*E” A + ys)U(pe)s 


M~yt-retvete- = 


e’Gr 1 
Fa cos 6;M,..(p, ave 
x U(p2)y"v(pi)u(py)yv" + ys)v(pe), (3.21) 


where the hadronic part of the quantity M_,, has the general structure 


Mw (p,q) = | d*x e'1™ (0 |T (JEI O)| rT) 


= V2 Fz L tt (p — g) IA e) +V Fe bur 
(p — q}? — m2 
— ha ((P — Q)udv — 8wa ` (P — 9) = FA llug — Burd?) 
+ thy€pvapq" p’. (3.22) 


The first line represents the tree diagram and in subsequent lines the subscripts V 
and A indicate whether the vector or axial-vector portions of the weak currents are 
involved. The form factor r4 in Eq. (3.22) can only contribute with virtual photons, 
i.e., asin mt — etv,ete. 

The yx* — yz reaction is analyzed in terms of the pion’s electric and mag- 
netic polarizabilities, œg and 6m, which describe the response of the pion to electric 


VII-3 The nature of chiral predictions 221 


Y y y Y Ys X 


(a) (b) (c) 


Fig. VII-3 Tree diagrams for (a) m+ > e™vey, mt —> e™vee™ e", and (b)-(c) 
yut > yn. 


and magnetic fields. In the static limit, electromagnetic fields induce the electric 
and magnetic dipole moments, 
Pe = 4n QEE, My = 478M, (3.23) 
which correspond to an interaction energy 
V = —27 (agE’ + yB’). (3.24) 


These forms emerge in the non-relativistic limit of the general Compton amplitude 


—iTy(P, p', qi) = —i fe eit (x+ (p) |T (Jœ) 0))| 77 (p)) 


(2p'+42),(2p—q), CP +41), 22-4), 
= = 48uv 
(p — qi)’ — m2 (p — Y — m} a 


+o (q2ud1v — Suv 4° q) Ey (3.25) 


where ø is a coefficient proportional to the polarizability and qf, q% are the photon 
momenta, taken as outgoing, with p = p’ + qı + q2. The first three pieces are 
the Born and seagull diagrams. The last contains the extra term which emerges 
from higher-order chiral lagrangians, and the ellipses indicate the presence of other 
possible gauge-invariant structures, which we shall not need. 

The chiral predictions are obtained in the same manner as used for the pion form 
factor. The results at q? ~ 0 are 


Ne = ha 2)r 2 
n= | S00Tas,. Spr (15 PEL ”) , 
"126/27? Fy N= hy — 
2 
rA 2| 7 @r l ma 
— = 32n7| LY” — ln — +1) 1], =0, 
e | f To )| eee 


a 4a Dr Dr a 1 2 


where t = (qı + q2)? and 


4 
Fos- anh? Aa e Se ee. (3.27) 
x b>x 12 
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Table VII-3. Chiral predictions and data in the radiative complex of transitions. 


Reaction Quantity Theory Experiment 
Y > ntn (r2) (fm?) 0.45% 0.45 + 0.01 
y > Kt K-~ (rz) (fm?) 0.45 0.31 + 0.03 
at > etvey hy(mz!) 0.027 0.0254 + 0.0017 
ha/hy 0.4414 0.441 + 0.004 
K+ > etyy (hy +ha)(m;') 0.136 0.133 + 0.008 
at > etwete ra/hy 2.6 2.2+0.3 
ynt > ynt (az + bm) (1074 fm) 0 0.17 + 0.02 
(az — By) (1074 fm) 5.6 13.6 + 2.8 
K > neve E = f-(0)/f+(0) —0.13 —0.17 + 0.02 
à+ (fm?) 0.067 0.0605 + 0.001 
ào (fm?) 0.040 0.0400 + 0.002 


“Used as input. 


The prediction for hy is especially interesting since hy is related by an isospin 
rotation to the amplitude for 7° —> yy (cf. Prob. VII-2). As we will show in 
Sect. VII—6, this is absolutely predicted from the axial anomaly. The presence of 
L'o implies that one of the above measurements must be used to determine it. We 
use the precisely known value for h4/hy to yield 


Lig(u = mp) = —(5.6 + 0.2) x 107°. (3.28) 


The results are compared with experiment in Table VII-3. 

We see that, with one exception, the chiral predictions are in agreement with 
experiment. That exception, the electric polarizability in ynt — ym*, comes 
from two difficult experiments. One uses a pion beam on a heavy Z atom 
[An et al. 85] and the coulomb exchange in mtA — yz7*A is used to pro- 
vide the extra photon (this is called the Primakoff effect). The tree diagram must 
be carefully subtracted off. The second experiment involves the use of a high- 
energy photon beam and the p(y, yz )n reaction and extrapolation to the virtual 
pion pole [Ah et al. 05]. In this case there exist a large number of background 
processes which must be subtracted. We note however that a recent experiment 
[Fr 12] using the Primakoff effect, not yet included in the averages above, obtains 
ap — By = 3.8 + 1.4 + 1.6. Before being concerned with the possible discrepancy 
with the chiral prediction, it would be preferable to have the experimental situation 
clarified. We have also listed the known results on kaonic processes predicted by 
the same constants. The analyses for y > K+K- and K — e*v,y are identical 
to the above results. 
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Pion-pion scattering 


The elastic scattering of two Goldstone bosons is the purest manifestation of the 
chiral effective field theory of QCD. It is aclassic topic with a long history. We use 
it here as an example of the convergence of the perturbative effective field theory 
expansion. 

The scattering of pions can be classified as S-wave, P-wave, D-wave, etc., with 
low partial waves dominating at low-energy. The amplitudes also can be decom- 
posed in overall isospin, Z = 0, 1, 2. Because the pions are spinless bosons, Bose 
symmetry requires that the even partial waves carry Z = 0, 2 and odd angular 
momentum requires J = 1. At low energies, the partial wave amplitude can be 
expanded in terms of a scattering length a} and slope b}, defined by 


ey q? 
Re T} = (4) (arni), (3.29) 


T 


where q? = (s — 4m2) /4. Since the chiral expansion is similarly a power series 
in the energy, a/ and b} provide a useful set of quantities to study. In practice, 
they are extracted from data by using dispersion relations and crossing symmetry 
to extrapolate some of the higher-energy data down to threshold. The only accurate 
very low-energy data are those from K — mzrev,. The experimental values are 
given in Table VII-. 

At lowest order in the energy expansion, the amplitude for zz scattering 
[We 66] can be obtained from £> with the result 


N 


Astu =? =A. (3.30) 


Table VI-4. The pion scattering lengths and slopes. 


Experimental Lowest order? First two orders“ 

a 0.220 0.005 0.16 0.20 

b? 0.25 0.03 0.18 0.26 

a —0.044 + 0.001 —0.045 —0.041 

b2 —0.082 0.008 —0.089 —0.070 

a! 0.038 + 0.002 0.030 0.036 

bt — 0 0.043 

al (17 +3) x 1074 0 20 x 10-4 
a? (1.3 +3) x 1074 0 3.5 x 1074 


“Predictions of chiral symmetry. 
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This produces the scattering lengths and slopes 


a? = Tm} pe m; Aa m3 
0 32m pe” 9 16r F? |! 24r F2’ 
b? = m? b2 2 m? 
0 Am F2’ 0 87 F2’ ee) 


with the numerical values shown in the table. It is remarkable that the lowest- 
energy form of a scattering process may be determined entirely from symmetry 
considerations. At next order in the expansion, one considers loop diagrams and 
the E* lagrangian. The convergence towards the experimental values can be seen 
in the table. Also shown are the best theoretical results which combine dispersive 
constraints with chiral perturbation theory [CoGL 01]. 

The nature of the chiral expansion also becomes evident within this process. 
The lowest-order predictions are real and grow monotonically. As such, they must 
eventually violate the unitarity constraint at some point. The worst case is the J = 
£ = 0 amplitude 


o 1 


rea m7), (3.32) 


which violates the simplest consequence of unitarity, 
s — 4m? 
en En; (3.33) 
s 


below ./s = 700 MeV. In addition, there are no imaginary terms, which must be 
present due to the unitarity constraint, 


L Am2 \ 
Im T{ = (=) cele (3.34) 


These drawbacks are remedied order by order in the energy expansion. Note that 
since ay | starts at order E*, Im T; starts at order E4. When one works to order 
E*, loop diagrams generate an imaginary piece given by Eq. (3.34) with the lowest- 
order predictions for T7 inserted on the right-hand side. This process proceeds 
order by order in the energy expansion. 

We have displayed in Table VII—4 how the O(E“*) predictions modify the scatter- 
ing lengths. Aside from the renormalization of m, and Fy, the corrections depend 
only on the low-energy constants (2L; + L5) and L5. Let us also give a pictorial 
representation of the result. One may see the order E4 improvement and the nature 
of the chiral expansion by considering the Z = 1, = 1 channel, where some of 
the higher-energy data are shown in Fig. VII-4. The resonance structure visible is 
the o (770). The chiral prediction is 
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` 200 400 600 800 1000 


Fig. VU-4 Scattering in the J = 1, £ = | channel. 


2 
i= Saa [ı zd Ci =i = Le) = l (3.35) 
with loops having a negligible effect. The lowest-order result is given by the dashed 
line. It clearly does not reflect the presence of the o(770) resonance. The solid line 
represents the result at order E4 and starts to reproduce the low-energy tail of the 
p(770). It is, of course, impossible to represent a full Breit-Wigner shape by two 
terms in an energy expansion; all orders are required. The chiral predictions at 
O(E*) may reproduce the first two terms, with the resulting expansion being in 
powers of q?/m*,. 


VII-4 The physics behind the QCD chiral lagrangian 


For the most part, we have been using chiral lagrangians as our primary tool for 
making predictions based on the symmetry structure of QCD. In this section, we 
pause to examine which features of QCD are important in determining the structure 
of chiral lagrangians. The general strategy can perhaps be appreciated by a com- 
parison of low-energy and high-energy QCD methodology. At high energies, due 
to the asymptotic freedom of QCD, hard scattering processes can be calculated in 
a power-series expansion in the strong coupling constant. However, some depen- 
dence on ‘soft’ physics remains in the form of structure functions, fragmentation 
functions, etc. These are not calculable perturbatively and must be determined phe- 
nomenologically from the data. At high energy, then, the predictions of QCD are 
relations among amplitudes parameterized in terms of various phenomenological 
structure functions and the strong coupling constant. At very low energies, because 
of the symmetries of QCD, low-energy scatterings and decays can be calculated 
in a power series expansion in the energy. However, some dependence on ‘harder’ 
physics remains in the form of the constants {L‘}. These are not calculable from 
the symmetry structure and must be determined phenomenologically from the data. 
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At low energy, then, the predictions of QCD take the form of relations among 
amplitudes whose structure is based on symmetry constraints but which are para- 
meterized in terms of empirical constants. Nevertheless, QCD should in princi- 
ple also predict the very structure functions and low-energy constants which are 
employed by these techniques. The trouble at present is that we do not have tech- 
niques of comparable rigor with which to calculate these quantities. Nevertheless, 
by using models plus phenomenological insight we can learn a bit about the physics 
which leads to the chiral lagrangian. 

The low-energy constants F, and m, which occur at order E? do not reveal 
much about the structure of the theory. All theories with a slightly broken chi- 
ral SU(2) symmetry will have an identical structure at order E?. The pion decay 
constant F, will be sensitive to the mass scale of the underlying theory, while the 
pion mass my will be determined by the amount of symmetry breaking. However, 
approached phenomenologically, these are basically free parameters and do not 
differentiate between competing theories. 

The situation is different at order E*. Here, the chiral lagrangian contains many 
terms, and the pattern of coefficients is a signature of the underlying theory. The 
linear sigma model without fermions provides us with an example of how one can 
compare a theory with the real world. In Sects. [V—2,4 we calculated the tree-level 
terms in £4 which would be present in the linear sigma model, and obtained a result 
expressible as 


2 


F 1 
211+ L; = 2L4 + Ls =8L6+4Lg=— = —, In791=0. (4.1) 


4m3 8A 


This pattern is quite different from the structure obtained phenomenologically. 
It appears that the linear sigma model is not a good representation of the real 
world. 

Unfortunately, it is harder to theoretically infer the {Z;} directly from QCD. 
However, a look at phenomenology indicates that we should consider the effects 
of vector mesons, in particular the o(770). This is the most clear in the pion form 
factor that shows a dramatic p resonance in the timelike region. Indeed, the whole 
form factor can be well understood in a simple model as being a Breit-Wigner 
shape due to the p resonance 


m2 


Gr(q’) =— 4 , 4.2 
4) =~ m + im, Tq? — 4m?) we 


where the normalization is chosen to enforce the condition G,(0) = 1. This 
works even in the timelike region. Comparison with the chiral lagrangian approach 
implies that this model would predict 
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2 


F 
Lo = r= 12x (0° >. (4.3) 
mgp 


in good agreement with the value obtained earlier, Lọ = (7.0 + 0.2) x 107°. 

This analysis can be extended to Lo. This enters into the Wtzt y vertex, which 
occurs in mt —> e*v,y. Here, both vector and axial-vector mesons can generate 
corrections to the basic couplings. Explicit calculation yields [EcGPR 89] 


F2 F2 
Lio = — 2 = —5.8 x 107. 4.4 
19 4m? 4m7 a aa 


Here, a; refers to the lightest axial-vector meson aı(1260) (cf. Sect. V—3), and 
F, and F, are the couplings of a; and p to the W+ and the photons respectively. 
Again, the result is close to the empirical value cited in Table VII-1, viz. Lig = 
(—5.6 + 0.2) x 107°. 

The phenomenological low-energy constants are scale-dependent, and their 
analysis includes loop effects, while those in Eqs. (4.3), (4.4) are constants, to be 
used at tree level. Nevertheless, there is some sense in comparing them. The effect 
of loops in processes involving Lo, Lio is small, and the scale dependence only 
makes a minor change, Lo(44 = 300 MeV) = 7.7 x 107° vs. Lou = 1 GeV) = 
6.5 x 10-3. Presumably the appropriate scale is near u = mp7. The (770) 
provides a much more important effect here than any other input. 

Finally, it also turns out that the use of vector meson exchange leads to a good 
description of zz scattering [DoRV 89, EcGPR 89]. This is not too surprising in 
light of the need for the chiral lagrangians to reproduce the tail of the o(770), as 
described in Sect. VII-3. As a consequence of crossing symmetry, the po (770) must 
also influence the other scattering channels. To a large extent, the chiral coefficients 
Li, L2, L3 are dominated by the effect of o(770) exchange. We see from these 
examples that phenomenology indicates that the exchange of light vector particles 
is the most important physics effect behind the chiral coefficients which we have 
been discussing. 

The idea that vector mesons play an important dynamical role is not new. It pre- 
dates the Standard Model, originating with Sakurai [Sa 69], in a form called vector 
dominance. The vector dominance idea has never been derived from the Standard 
Model, but nevertheless enjoys considerable phenomenological support. Put most 
broadly, vector dominance states that the main dynamical effect at energies less 
than about 1 GeV is associated with the exchange of vector mesons. The use of a 
chiral lagrangian with parameters described by p-exchange is compatible with this 
idea and puts it on a firmer footing. These considerations suggest that for chiral 
lagrangians the prime ingredient of QCD is the spectrum of the theory. The lin- 
ear sigma model has a quite different spectrum, with a light scalar and no p, and 
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hence does not agree with the data. QCD, however, seems to predict that devia- 
tions from the lowest-order chiral relation must be in such a form as to reproduce 
the low-energy tails of the light resonances, in particular the p. At present, we 
cannot rigorously prove this connection. However, it remains a useful picture in 
estimating various effects of chiral lagrangians. 
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At this stage one must also include the effect of the axial anomaly. The anomaly 
influences not only processes involving photons, such as 7° —> yy, but also purely 
hadronic processes. For example the reaction KK —> a+s~z°, allowed by QCD, 
is not present in any of the chiral lagrangians appearing in previous sections. Its 
absence is easy to understand because the hadronic part of the lagrangian, with 
external fields set equal to zero, has the discrete symmetry g' > —g! (i.e. U < 
U+) which forbids the transition of an even number of mesons to an odd number. 
However this is not a symmetry of QCD. More importantly, there are a set of low- 
energy relations, the Wess—Zumino consistency conditions [WeZ 71], which must 
be satisfied in the presence of the anomaly and which involve hadronic reactions. 
The effect of the anomaly was first analyzed by Wess and Zumino who noted that 
the result could not be expressed as a single local effective lagrangian, and gave 
a Taylor expansion representation for it. Witten [Wi 83a] subsequently gave an 
elegant representation of the Wess—Zumino contribution as an integral over a five- 
dimensional space whose boundary is physical four-dimensional spacetime. 

Since the considerations leading to the Wess—Zumino—Witten action can be 
rather formal, it is best to adopt a direct calculational approach. Fortunately, we 
are able to employ the familiar sigma model (with fermions) because it contains 
the same anomaly structure as QCD. That is, it is the presence of fermions having 
the same quantum numbers as quarks which ensures that the anomaly will occur. 
The absence of gluons in the sigma model is not a problem since, according to the 
Adler—Bardeen theorem [AdB 69], the inclusion of gluons would not modify the 
result. Since the sigma model involves coupling between mesons and fermions, 
we can also observe directly the influence of the anomaly on the Goldstone bosons. 
Although somewhat technically difficult, our approach will clearly illustrate the 
connection with treatments of the anomaly based on perturbative calculations. 

Consider as a starting point the lagrangian, Eq. ([V—1.11), of the linear sigma 
model 


L = wip w — gv (WLU Yr + WRU WL) +. (5.1) 


6 Fora textbook treatment, see [Ge 84]. 
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We have displayed neither the term containing Tr (d,,U0“U") nor any term con- 
taining the scalar field S. Such contributions are not essential to our study of the 
anomaly and will be dropped hereafter. In order to simulate the light quarks of 
QCD, we shall endow each fermion with a color quantum number (letting the num- 
ber Ne of colors be arbitrary) and assume there are three fermion flavors, each of 
constituent mass M = gv. Although the original linear sigma model has a flavor- 
SU (2) chiral symmetry, Eq. (5.1) is equally well defined for flavor SU (3). 
Our analysis begins by imposing on Eq. (5.1) the change of variable 


p=, We=EWe, Ẹ=U, (5.2) 
like that described in App. B—4. This yields 
L= Y" (iP — Mw", Dy = 0, HiV, +iApys: 
Vu =i (GE + E48"), By =—5 EEEE 63) 


For this change of variable the jacobian is not unity, and thus we must write the 
effective action as 


eT — [raved gif tx(Hi OV—M (GU e+ Feu V1) 


~ J [dw dyp I el A VEP- my" 


— eI ot mGD-M) (5.4) 


For large M, it can be shown that the tr In(i P — M) factor does not produce any 
terms at order E* that contain the e””®? dependence characteristic of the anomaly.’ 
Hence, the effect of the anomaly must lie in the jacobian J, and it is this we must 
calculate. 

It is possible to determine the jacobian by integrating a sequence of infinitesimal 
transformations. Thus we introduce the extension £ — &,, 


-th@ 


E, = e'r =expitg, (5.5) 


where T is a continuous parameter and € = &,_;. Transformations induced by the 
infinitesimal parameter ôt will give rise to the infinitesimal quantities £s; and ô J, 


7 This can be verified by expanding as 


tr Inip — M) = tr In(—M(1 —iP/M)) = tr In(—M) — tr (iP)? /2M* +--- 


The first term can be regularized as in the text and directly calculated using the techniques described in 
App. B. The remaining terms vanish for large M. 
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1 1— 
Y = pL + Yr > y= |a TE 4 bi ly. 

J [dy]ldġ] = J [d'Iay'] T, (5.6) 

From Eqs. (M-3.44), (H-3.47), we find 67 to be 
ST 2 emote (Pys) (5.7) 

or 

dinJ kua fo 

7 = —2itr (Q ys). (5.8) 
tT=0 


This result should be familiar from our discussion of the axial anomaly in 
Sect. IU—3. There remain two steps, first to calculate the regularized representa- 
tion of tr (@ ys), and then to integrate with respect to T. 

To regularize the trace, we employ the limiting procedure 


tr (Pys) = lim tr @ys exp[-€P,Pr]) (Dk =a" + iV; +iA, ys), (5.9) 


with A. and ve as in Eq. (5.3), except now constructed from é, and é}. For 
arbitrary t, we make use of the identities 


Ve =V, DV +i [Vi V] i [aA] =O, 


At’ = "A, — OA, +i (Ve A] +i [2ye] =0, (5.10) 
to express), 2, in the form 


PD. = d,d” +o, 
dy = ðu + Vag T OpvA,Y5 = ðu + Try, 
palaga (a, +iVeu) A | ys. (5.11) 


From the heat-kernel expansion of App. B, we have? 
= ae Y5 
tr =1 out X e"an 
(gy5) mi f x Tr (5 : €"a 


4 oe nee n 
Se d'x Tr (pys | +a + |). (5.12) 


Carrying out the “Tr’ operation, which involves some application of Dirac algebra, 
yields 


8 Note the distinction between ‘tr’ and ‘Tr’, as in Eq. (III-3.48). 
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8 j 
Tr (ysPar) = 2i Ne Tr Gas AAT ) fees, (5.13) 


where the ellipses denote contributions not involving €,,g and the factor Ne comes 
from the sum over each fermion color. Combining the above ingredients, we have 
for the regulated action 


enii a 


ae 


—v—a— f 
== dx Tr (? Enap AL A, A.A ae , (5.14) 


where we recall that ọ = E 0/(2F,,). This result expresses the effect of the 
anomaly on the Goldstone bosons. 

Unfortunately, there is no simple way to integrate the entire expression of 
Eq. (5.14) in closed form. In principle, we could represent each of the axial-vector 
currents therein (e.g. A.) as a Taylor series expanded about t = 0 and perform the 
integrations to obtain a series of local lagrangians. Alternatively, however, one can 
simply express Eq. (5.14) as an integral over a five-dimensional space provided we 
identify t with a fifth-coordinate x5 (defined to be timelike). In this case, we use 


gapt = UaU] = — 1", 
_ 1a i 
g= Ut = T (5.15) 


plus the cyclic property of the trace to write 


iNe ipj m 
Twzw(U) = E AX €ijkim tt (LİLİ L*L' L”), (5.16) 
where i,...,m = 5,0,1,2,3 with €! = +1. This is Witten’s form for the 
Wess—Zumino anomaly function. The t = 1 boundary is our physical space- 


time, and the fifth coordinate is just an integration variable. Since each term in the 
Taylor expansion can be integrated, the result depends only on the remaining four 
spacetime variables. Observe that [wzw(U) vanishes for U in SU (2) due to the 
properties of Pauli matrices. For chiral SU (3), the process K+ K~ > mtx is 
the simplest one described by this action and after expanding l wzw, it is given by 
the lagrangian, 


N 
— £ vaß 
— 40n 2F3 E Tr (p9 PO, Pap Og9), (5.17) 


with gp =A -Q. 
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The above discussion has concerned the impact of the anomaly on the Goldstone 
modes. We must also determine its proper form in the presence of photons or W= 
fields. For this purpose, we can obtain the maximal information by generalizing the 
fermion couplings to include arbitrary left-handed or right-handed currents €,,, fu, 


L= ipy — M (PLU Yr + VR YL), 


1 1— 
Debri En (5.18) 
2 2 
The calculation of the jacobian then involves the operator 
=l 1— 
Deni Cpm S 
l; = Elur = i£ ð uér, Tu = Er é = i£ ð éi, (5.19) 


which generalizes Eq. (5.3). It is somewhat painful to work out the full answer 
directly, but fortunately we may invoke Bardeen’s result of Eq. (III—3.64) for the 
general anomaly. Using the identities 


= ETL sks Uw = Ete Ey + Erruvél, 
_ SS ee (5.20) 
Tuv = Err wv,» ay = (lu — Fa) /2, 


where Lav, Fa are given in Eq. (III-3.65), we obtain 


Ne j 4 8 _ FT AA 
rwzw = Age at fa x eur Tr g ~ 3 AHA vat B 
0 


a -L — > 
=F qp Cun bap + Fulop) + 54 ETa F Fuvlog) 


= E pā, Tag +, Taāg + Udi) (5.21) 


Note that the first term corresponds to our previous calculation of Eq. (5.14). 
The WZW anomaly action contains the full influence of the anomalous low energy 
couplings of mesons to themselves and to gauge fields. By construction, it is gauge 
invariant. The t integration can be explicitly performed for all terms but the first in 
Eq. (5.21). However, in the general nonabelian case the result is extremely lengthy 
[PaR 85]. For the simpler but still interesting example of coupling to a photon field 
A,,, the result is 
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Twzw (U, Ay) = Twzw (U) 


aae fax fen Tr (Q (R Ra Rg + LyLaLg)) 


1 
— ie FyyAq Tr (o (Lp + Rp) + 5 (QU' QUR,z + QU ou't,))], 
(5.22) 


+ 


where R, = (0,U")U, L, = U3 U+? 

We have seen here that whereas the anomalous divergence of the axial cur- 
rent represents the response to an infinitesimal anomaly transformation, the WZW 
lagrangian represents the integration of a series of infinitesimal transformations. In 
our analysis of the sigma model, the anomaly has forced the occurrence of certain 
couplings, among them 27° —> yy, y —> 3m and KK —> 3r. As noted earlier, 
although these results are based on an instructional model, the result has the same 
anomaly structure as QCD because the answer must depend on symmetry proper- 
ties alone. Indeed, such conclusions were originally deduced from anomalous Ward 
identities [WeZ 71] without any reference to an underlying model. We regard such 
predictions as among the most profound consequences of the Standard Model. 


VII-6 The axial anomaly and z’ > yy 


The description of pions and photons presented thus far does not include the decay 
x? —> yy. This process is important in QCD, because to understand it one must 
include the anomaly in the axial current. The 2° — yy amplitude has the general 
structure 


M05 yy = —iAyy et et ke ks, (6.1) 
as required by Lorentz invariance, parity conservation, and gauge invariance, and 
leads to the decay rate 


3 
mo 2 
Py syy = zga |Ayy| (6.2) 
From the experimental value, T = 7.74 + 0.37 eV, we find 

Ayy = 0.0252 + 0.0006 Ge V` !. 


We can obtain the lagrangian for 7° —> yy from the WZW action of the previ- 
ous section, restricting the chiral matrices to SU (2). 


9 Witten’s original result did not conserve parity, and this was subsequently corrected [PaR 85, KaRS 84]. 
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c 


La = Ne ewop [eA Tr (QL LaLg + QR, Ra Rg) — ie? FyvAaTg], (6.3) 


487? 
with 
L, = Uð, U', R, =4,U'U, 
, 1 fe 
Tz = Tr (œr + Q’ Rg + zQUQU'Lg + 50U" OUR») (6.4) 


where A, is the photon field, F,» is the photon field strength, and Ne = 3 is the 
number of colors. A crucial aspect of this expression is that it has a known coeffi- 
cient. In this respect, it is unlike other terms in the effective lagrangian, which have 
free parameters that need to be determined phenomenologically. This is because it 
is a prediction of the anomaly structure of QCD. A corollary of this is that £4 must 
not be renormalized by radiative corrections. This was proven at the quark—gluon 
level by Adler and Bardeen [AdB 69]. 

The x? — yy amplitude is found by expanding £4 to first order in the pion 
field, yielding 


aN. 


2 
eNe 
3 Tr (Q°t3) e” F v Agdgn® = 


= vaf 0 
~ 4872 F, Dank, © Tokat , (6-5) 


A 


where we have integrated by parts in the second line. This produces a m? > yy 
matrix element of the form 


= aNe __, 9.0251Gev-!, (6.6) 
3a Fy Ne=3 
in excellent agreement with the experimental value. This is widely recognized as 
an important test of QCD, both as a measurement of the number of colors and also 
as a reflection of the symmetries and anomalies of the theory. It is a remarkable 
result. 

What would have happened if the axial anomaly were not present? The decay 
z? — yy could still occur, but it would be suppressed. The 7° —> yy transition 
must be at least of order E*, as it must involve the dimension-four operator F F. 
The anomaly occurs at this order. However, non-anomalous lagrangians leading 
to this transition can be constructed at order E6. This result was first derived by 
Sutherland and Veltman using a soft-pion technique [Su 67, Ve 67]. 

At what level would we expect corrections to the anomaly prediction for 7° > 
yy? It has been checked that m2 In m2 corrections, which in principle can occur 
when meson loops are present, do not in fact modify the lowest-order result when 
it is expressed in terms of the physical decay constant Fy. However, there are still 
corrections of order m2 /A*, where A is the scale in the energy expansion, which 
amount to modifications of order 2% [GoBH 02]. 
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Problems 


(1) Radiative corrections and x p decay 
To bring Pz ev,/ z+, into agreement with experiment requires a radiative 
correction whose dominant contribution is the so-called seagull component 
(V2Fz8uv) Of Myy (cf. Eq. (3.22)). 
(a) Verify that gauge invariance requires 


iq" M(p, q) = (O|A, 7 |x * (p)), 


and show that the seagull term is required in this regard to cancel the pion 
pole contribution. 

(b) Use the seagull term in Feynman gauge to calculate the radiative correction 
to me decay. Introduce a photon cut-off via 


1 1 =i 
k2 => k2 k= 
so that 
, 2 GF d'k 1 — 
M” = ie Ey, VIr g" 
| Gye a $ 
x UCP) ya (1 + ys) ———_——-y, (pe), 
(Pv) yal D” (Pe) 


and show that 


where 


G _ 
M® = igg VaV ZF pip) + ys) (pe) 


G 
= ~i g VaV Fam (p) C — y5)v(pe) 


is the lowest order amplitude for the z¢2 process. This then is the origin of 
the lepton-mass-dependent radiative correction. 


(2) Radiative pion decay and the anomaly 
Writing the 2° decay amplitude as 


M,0sy) = ie’ el ez” J d xet (0T (VE (x) V™(0)) [7° (p)) 


— + Lk vx ap 
= ie] € Euvapqi P Ayy» 
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and the vector current amplitude in radiative 2* decay as 
V * iqi- =f 
Mert sttuy = tee” / d'xe!™* (O|T(V" (x) V, 0) lrt) 


— +e 
= —iee; Ce wane Pr hy, 


demonstrate that isotopic spin invariance requires /2hy = Ayy. 
(3) Unitarity and the pion form factor 
(a) Verify that the pion form factor given in Eq. (3.13) obeys the strictures of 
unitarity, i.e., 
ime oa [| 22”. 
7 “J (2m)624)245 


x (2x)t8 (pi + pr — qı — q9) (qı — q2)u (0 (Qi) (Qo) |x (pi) (22)) 


where the matrix element is the two-derivative (tree-level) pion—pion scat- 
tering amplitude given in Eq. (3.4). 
(b) How does this result change if the K*K~ intermediate state is added to 
ntn? 
(4) Other worlds 
Describe changes in the macroscopic world if the quark masses were slightly 


different in the following ways: 
(a) m, = m4 = 0, 

(b) m, > ma, 

(c) m, = 0, mg = ms. 


Vill 


Weak interactions of kaons 


The kaon is the lightest hadron having a nonzero strangeness quantum number. It 
is unstable and decays weakly into states with zero strangeness, containing pions, 
photons, and/or leptons. We shall consider decays in the leptonic, semileptonic, 
and hadronic sectors to illustrate aspects of both weak and strong interactions. 


VIII-1 Leptonic and semileptonic processes 
Leptonic decay 


The simplest weak decay of the charged kaon, denoted by the symbol K¢o, is into 
purely leptonic channels K+ > ptv,, K+ — etve. Such decays are character- 
ized by the constant Fx, 


(0 |Sypysu| Kt (k)) =iV2 Frky. (1.1) 


As discussed previously, because of SU (3) breaking Fx is about 20% larger than 
the corresponding pion decay constant F,. As with the pion, but even more so 
because of the larger kaon mass, helicity arguments require strong suppression of 
the electron mode relative to that of the muon. The ratio of etv, to u"v, decay 
rates, as in pion decay, provides a test of lepton universality [RPP 12], 

rt 


Ktsetve 


= (2.488 + 0.012) x 1075, (1.2) 


expt 


Pxtspty, 


in good agreement with the suppression predicted theoretically [CiR 07], 


2: 
r’ a ne. 
An l | mal (1 +8) = (2.477 £0.001) x 1075, (1.3) 


2 fyi? 
Pxt+sutv, m 1 m7, /mk 
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where 6 = —0.04 is the electromagnetic radiative correction including the 
bremsstrahlung component.! The notation T” indicates that the experimenters have 
subtracted off the large structure-dependent components of Kt —> "vey but 
have included the small bremsstrahlung component. 


Kaon beta decay and Vys 


The kaon beta decay reactions Kt —> wlt ve and K? > m+, called K} 
and K?, respectively, also are important in Standard Model physics. They are each 
parameterized by two form factors, 


(17 (p) |5y,u| K°)) = FE (q3 k + Dy + SET (QP k Dy 
Kn. (g2) rys 


k k — . (1.4 
Va (K+ p), + Va ( Pu (1.4) 


(°(p) |Sy u| K*(k)) = 


Isospin invariance implies f£ Om fE ta? = f+. SU (3) symmetry can be invoked 


to relate these matrix elements to the strangeness-conserving transition m" —> 
m?l" vp, resulting in f,(0) = —1 and f_(0) = 0. The deviation of f,(0) from 
unity is predicted to be second order in SU (3) symmetry breaking, i.e., of order 
(ms — Y. This result, the Ademollo—Gatto [AdG 64] theorem, is proved by con- 
sidering the commutation of quark vector charges, 


[o", o™] = QSS, (1.5) 
where 


Q" = / dx G(x) yoqi (x). (1.6) 


Taking matrix elements and inserting a complete set of intermediate states 
gives 


1= D> (lm [o™| Kò- |in [0| Kof). (1.7) 


Finally, we isolate the single m~ state from the sum and note that in the SU (3) 
limit the charge operator can only connect the kaon to another state within the 
same SU (3) multiplet. This implies” 


(n #27 |Q"| K°) =O€), (1.8) 


! The dominant term here is the simple contact contribution —3(a/7r) In(m,,/me) discussed in Sect. VII-1. 
2 This is easiest to obtain in the limit PK > œ. 
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where € is a measure of SU (3) breaking, and we thus conclude that 


1-| oO) = O(), (1.9) 


which is the result we were seeking. 

It is interesting that the SU (2) mass difference m, Æ mg can modify f. s tn? (0) 
in first order despite the Ademollo—Gatto theorem. This can be seen by consid- 
ering a K” in the formulae of Eq. (1.7). Now there exist two intermediate states 
in the same octet as the kaon, i.e. 7° and 7°, and it is their sum which obeys the 
Ademollo—Gatto theorem, 


2 2 
7) + ("| =14+0(e). (1.10) 
In the isospin limit, each term must separately obey the theorem because of the 
isospin relation f i ta? f En, However, when m, 4 mg each form factor in 
Eq. (1.10) can separately deviate from unity to first order in m, — mg as long 
as the first order effect cancels in the sum. Indeed this is what happens, yielding 
(cf. Prob. VIII.1) 


A" © ae 3 (= — Mu 
fE 0) 4 


= ) + Len = 1.021, (1.11) 


Ms — m 


where €x = 0.004 arises from chiral corrections at O(E*) [GaL 85b 85b]. This 
number can also be easily extracted from experiment by using the ratio of K* and 
K° beta decay rates, with the result [CiR 07], 


FEO) 


in agreement with the prediction. 

The prime importance of the Ke3 process is that it provides the best determi- 
nation of the weak mixing element V,,. Because of the Ademollo-Gatto theorem, 
the reaction is protected from large symmetry breaking corrections. In addition, 
the use of chiral perturbation theory allows a reliable treatment of the reaction. The 
above study of the form factors indicates that the theory is under control within the 
limits of experimental precision. The value [LeR 84] 


Vas = 0.2253 + 0.0013 (1.13) 


follows from an analysis of the K° and K* decay rates. 
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VIII-2 The nonleptonic weak interaction 


For leptonic and semileptonic processes, at most one hadronic current is involved. 
There exist also nonleptonic interactions, in which two hadronic charged weak 
currents are coupled by the exchange of W~ gauge bosons, 


2 


g v à a 

Hoi = = d*x DE (x, My)T (J$ (x /2) J3 (—x/2)), 
d 

Jud = (a Et) Vy, +y) |s], (2.1) 
b 


with V being the CKM matrix, given in Eq. (II-4.17). Such interactions are diffi- 
cult to analyze theoretically because the product of two hadronic currents is a com- 
plicated operator. If one imagines inserting a complete set of intermediate states 
between the currents, all states from zero energy to Mw are important, and the 
product is singular at short distances. Thus, one needs to have theoretical control 
over the physics of low-, intermediate-, and high-energy scales in order to make 
reliable predictions. Because this is not the case at present, our predictive power is 
substantially limited. 

Let us first consider the particular case of AS = 1 nonleptonic decays. These 
are governed by the products of currents 


aV"eargs; decry,  dT”tiT,s, (2.2) 
where I’, = y,(1 + ys) and color labels are suppressed. The first of these would 
naively be expected to be the most important, because kaons and pions predomi- 
nantly contain u, d,s quarks. However, the others also contribute through virtual 
effects. Some properties of the AS = 1 nonleptonic interactions can be read off 
from these currents. The first product contains two flavor- SU (3) octet currents, one 
carrying J = 1/2 and one carrying J = 1, 


SUB): B8 8)ymm = 8 © 27, (2.3a) 
isospi ia = ee (2.3b) 
1SOSp1nN : =. = 7 
osp 2729y 


where the symmetric product is taken because the two currents are members of the 
same octet. The singlet SU (3) representation is excluded from Eq. (2.3a) because a 
AS = 1 interaction changes the SU (3) quantum numbers and hence cannot be an 
SU (3) singlet. The other two products in Eq. (2.2) are purely SU (3) octet and 
isospin one-half operators. The currents are also purely left-handed. Thus, the non- 
leptonic hamiltonian transforms under separate left-handed and right-handed chiral 
rotations as (8z, 1r) and (27, 1r). These symmetry properties, valid regardless of 
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the dynamical difficulties occurring in nonleptonic decay, allow one to write down 
effective chiral lagrangians for the nonleptonic kaon decays. The hamiltonian is a 
Lorentz scalar, charge neutral, AS = 1 operator, and has the above specified chiral 
properties. 

At order E”, there exist two possible effective lagrangians for the octet part, viz., 
Loctet = Lg + Lg, where in the notation of Sect. IV-6, 


Lg = gs Tr (àD U D”UŻ), = Lge = Bg Tr (AwxU') +he. (2.4) 


It can easily be checked that both £g and £g are singlets under right-handed trans- 
formations, but transform as members of an octet for the left-handed transforma- 
tions. The barred lagrangian in Eq. (2.4) can in fact be removed, so that it does not 
contribute to physical processes. This is seen in two ways. At the simplest level, 
direct calculation of K —> 27 and K —> 3x amplitudes using Lg, including all 
diagrams, yields a vanishing contribution. Alternatively, this can be understood by 
noting that in QCD the quantity x appearing in Eq. (2.4) is proportional to the 
quark mass matrix, x = 2Bom,. Thus, the effect of Ls is equivalent to a modifica- 
tion of the mass matrix, 


Mg > m} = Mg + 8gà6mq. (2.5) 


This new mass matrix can be diagonalized by a chiral rotation 
Tr (m U) > Tr (Rm LU) = Tr (mpU), (2.6) 


with mp diagonal. The transformed theory clearly has conserved quantum num- 
bers, as it is flavor diagonal. This means that the original theory also has conserved 
quantum numbers, one of which can be called strangeness. When particles are 
mass eigenstates, even in the presence of Ls, the kaon state does not decay. Hence, 
this Lg can be discarded from considerations, leaving only £g as responsible for 
octet K decays [Cr 67]. This octet operator is necessarily AJ = 1/2 in character. 
Another allowed operator, transforming as (27,, 1), contains both AJ = 1/2 and 
AI = 3/2 portions, 


Loy = LY)? + LE, (2.7) 
where 
Ce =e CIE Tr 8, UU) ee, (2.8a) 


LSP = g9 CI Tr 76 UAP 3 U U) he (2.8b) 
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The coefficients are given by 


3/3 
1/2 = 1/2 _ 1/2 _ 
C64i7/2,3 =1, Cy 4i5/2, 1-2/2 = -V2, C64i7/2,8 “Om? 
1 
3/2 T 3/2 _ 
C64i7/2,3 =], Ci4i5/2, 1-i2/2 = Wem (2.9) 


The complete classification at order E* is difficult, but has been obtained 
[KaMW 90]. We shall apply these lagrangians to the data in Sects. VUI-4, XII-6. 
There we shall see that gg >> gb}, whereas naive expectations would have octet 
and 27-plet amplitudes being of comparable strength. This is part of the puzzle of 
the AJ = 1/2 rule. The reliable theoretical calculation of the nonleptonic decay 
amplitudes, which is tantamount to predicting the quantities gg, gy 2 and a a 
is one of the difficult problems mentioned earlier. It has not yet been convincingly 
accomplished. The best we can do is to describe the theoretical framework of the 


short distance expansion, to which we now turn. 


VIII-3 Matching to QCD at short distance 


At short distances, the asymptotic freedom property of QCD allows a perturba- 
tive treatment of the product of currents. The philosophy is to use perturbative 
QCD to treat the strong interactions for energies Mw > E > pm. The result is 
an effective lagrangian which depends on the scale u. Ultimately matrix elements 
must be taken which include the strong interaction below energy scale u and the 
final result should be independent of jz. The subject provides a classic example of 
the techniques of perturbative matching to effective lagrangians and the use of the 
renormalization group. 


Short-distance operator basis 


As introduced in Sect. [V—7, the outcome of the short-distance calculation can be 
expressed as an effective nonleptonic hamiltonian expanded in a set of local oper- 
ators with scale-dependent coefficients (Wilson coefficients) [Wi 69], 


ao Gr., 
Hi! =A Vas 2 CWO; (3.1) 


As in any effective lagrangian, those operators of lowest dimension should be 
dominant. If the operator O; has dimension d, its Wilson coefficient obeys the 


scaling property C; ~ M oe Let us first see how this hamiltonian is generated in 
perturbation theory. We can later use the renormalization group to sum the 
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u,c,t u d d s WwW d 
G u,c,t 
Ww s u u Q Q 
(a) (b) (c) 
Fig. VIN—1 QCD Radiative corrections to the AS = 1 nonleptonic hamiltonian. 


leading logarithmic contributions. The lowest-order diagrams renormalizing the 
current product appear in Fig. VII-1. 

The process in Fig. VII-1(a) corresponds to a left-handed, gauge-invariant 
operator of dimension 4, 


o4 = d P(1 + ys)s. (3.2) 


This operator can be removed from consideration by a redefinition of the quark 
fields (cf. Prob. IV—1). The remaining operators are of dimension 6. Simple W 
exchange with no gluonic corrections gives rise in the short-distance expansion to 
the local operator 


Oa = dy, (1+ ys) uity" (1+ ys)s, (3.3) 


with a coefficient C4 = 2 in the normalization of Eq. (3.1). The gluonic correction 
of Fig. VIII-1(b) generates an operator of the form 


dy, (1 + ys) àfu ity" (1 + ys) A%s, (3.4) 


where the {A“} are color SU (3) matrices. However, use of the Fierz rearrangement 
property (see App. C) and the completeness property Eq. (II-2.8) of SU (3) matri- 
ces allow this to be rewritten in color-singlet form 


= 2 2 
dy, (1 + ys) àu üy" (1 + ys) rts = -304 +208, 
where 
Os = ity, (1+ ys) udy" (1 + ys) 5. (3.5) 


The strong radiative correction is seen to generate a new operator Op. 


Perturbative analysis 


Consider now the one-loop renormalizations of the four-fermion interaction 
Fig. VHI-1(b). In calculating Feynman diagrams we typically encounter integrals 
such as (neglecting quark masses) 
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Mw) Í dk 1 1 i i K? RS (3.6) 
= == n 5 . 
i (n) kt k- M},  16r?M}, K?+M3 |, 


where we evaluate the integral at the lower end using a scale u. Clearly, Mw 
presents a natural cut-off in the sense that 


—i 0 (u ~ Mw), 
LOT : (3.7) 
Sr My |Inu/My (u< My). 
The modification of the matrix element to first order in QCD is then 
2 M2 
EEE E T © G0; = 08. (3.8) 
167 u? 


where g3 is the quark-gluon coupling strength. The gluonic correction to Og must 
also be examined, and a similar analysis yields 


2 M2 
Os > Og — < in| € \ 601 — Op). (3.9) 
167? u? 


We observe that the operators, 


1 
0+ = 5 (O4 + Oz), (3.10) 


are form-invariant, O+ — C+ Ox, with coefficients C+, 


2 2 
§3 Mwy 
Ci =) ie n—~, 3.11 
7 = 16r? p oe 
where d} = —2 and d_ = +4. The isospin content of the various operators can be 


determined in various ways. Perhaps the easiest method involves the use of raising 
and lowering operators [Ca 66], 


Ld=u, lLū=-d, Lu=d, I_d=-ui, (3.12) 


to show that J, O_ = 0, implying that O_ is the J, = 1/2 member of an isospin 
doublet. With repeated use of raising and lowering operators, one can demonstrate 
that O_ is purely AJ = 1/2 whereas Ox is a combination of AJ = 1/2 and 
AI = 3/2 operators. 

From Eq. (3.11), we see that under one-loop corrections the operator O_ is 
enhanced by the factor 


2 2 

83 My 
C_=14+4-—. h —- x 2.1, 3.13 
i 167? = u? [aa 
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where we use a,(u) = 0.4 (Agcp œ 0.2 GeV) at u ~ 1 GeV. Similarly O4 is 
accompanied by the suppression factor 


M2 
la — x 0.4. (3.14) 


C= 1—2 
: 16r? pu? 


Renormalization-group analysis 


Choosing an even smaller value of u would lead to an even larger correction. 
However, maintaining just the lowest-order perturbation in the QCD interaction 
would then be unjustified. It is possible to do better than the lowest-order pertur- 
bative estimate by using the renormalization group to sum the logarithmic factors 
[GaL 74, AIM 74]. In a renormalizable theory physically measurable quantities can 
be written as functions of couplings which are renormalized at a renormalization 
scale ug. Physical quantities calculated in the theory must be independent of up. 
Denoting an arbitrary physical quantity by Q, this may be written 


Q = f (g3(uR), HR), (3.15) 


where f is some function of upg and g3 is the strong coupling constant of QCD. 
Differentiating with respect to ug, we have 


d 
re | (83(uR), Lr) = 9, (3.16) 
HR 


which is the renormalization-group equation. It represents the feature that a change 
in the renormalization scale must be compensated by a modification of the cou- 
pling constants, leaving physical quantities invariant. In order to see how this pro- 
gram can be carried out for the effective weak hamiltonian, consider the following 
irreducible vertex function which represents a typical weak nonleptonic matrix 
element, 


(0|7 (IE Oa Oa Pappa )| 0) 


= (VZ) (0|r (IE ORO Pedi) > B17) 


where the {q;} are quark fields carrying momenta {p;}. Z2 is the quark wavefunc- 
tion renormalization constant for the fermion field, and subscripts ‘ren’, ‘unren’ 
denote renormalized and unrenormalized quantities. 

Choosing the subtraction point p? = —%, we require that unrenormalized 
quantities be independent of upg, 


irr 


ba (o|r (Z Panaa)|o}] =0. (3.18) 


unren 
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This implies 
irr 


0 f] L 
(urz + Poco zg — avr) (0 |r (4) aaas)| 0) =0, 


ren 


where g3, is the renormalized strong coupling constant, Bgcp is the QCD beta 
function of Eq. (I-2.57(b)) and yr is the quark field anomalous dimension of 
Eq. (II-2.69). As we have seen, QCD radiative corrections generally mix the local 
operators appearing in the short-distance expansion, 


(O|T (Ongi92434a)| 0), = J Xan (0 |T (Onr91424344)| O) ren (3.19) 


n 


and the mixing matrix can be diagonalized to obtain a set of multiplicatively renor- 
malized operators 


(OT (Oxq1929344)| 0)", = Z,(O|T (Ox91929394)| 0)" en- (3.20) 
If operator Ox has anomalous dimension yg, we can write 
Zp 1 ayy Inept e 5 (3.21) 


and so the coefficient functions C;(jurx) satisfy 


0 0 
(ues + oco =— + Yk = avr) Ci(urRX) = 0. (3.22) 
OLR 083r 


From the above, we have for the operators O 


2 


Eg 
y+ — 4yr > Tezis (3.23) 


We can solve Eq. (3.22) with methods analogous to those employed in Sect. II-2. 
That is, because QCD is asymptotically free and we are working at large momen- 
tum scales, we can use the perturbative result (cf. Eq. (II-2.57(b))), 


383r 83 
Bocv(83r) = LR TP 6x2 


; (3.24) 


where b = 11 — Snp, ny being the number of quark flavors. Upon inserting the 
leading term in the perturbative expression for a, (cf. Eq. (I-2.74)), 


127 
(33 — 2n p) In uh / A? 


æs(uR) = (3.25) 
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one can verify that the solution to Eq. (3.22) is given by 


ds. /b 
C+ (uR) 83 My 
——— = [1 b In — : 3.26 
C(Mw) N T Tora 8 
Note that in the perturbative regime where a, « 1, we have 
C4 (uR) 83 My 
= f + ds. In =i (3.27) 
C+ (Mw) 167? uR 


which agrees with our previous result, Eq. (3.11). It is the renormalization group 
which has allowed us to sum all the ‘leading logs’. Of course, at scale Mw one 
must be able to reproduce the original weak hamiltonian, implying C+ (Mw) = 
C_(My) = 1. Taking ugr ~ 1 GeV and a, = 0.4 as before, we find 


HA (uR) « Cy (ue) Ox +C- (ur) O-, (3.28) 

with 
C-(uR) 2 1.5, C+ (ur) = 0.8. (3.29) 
We observe then a AZ = 1/2 enhancement of a factor of two or so, which is 


encouraging but still considerably smaller than the experimental value of Ag/A2 ~ 
22 discussed in the next section. 

Two additions to the above analysis must now be addressed. One is the proper 
treatment of heavy-quark thresholds. In reducing the energy scale from Mw down 
to ur, one passes through regions where there are successively six, five, four, or 
three light quarks, the word ‘light’ meaning relative to the energy scale wr. The 
beta function changes slightly from region to region. A proper treatment must apply 
the renormalization group scheme in each sector separately. This is a straightfor- 
ward generalization of the procedures described above. 

The other addition is the inclusion of penguin diagrams of Fig. VII-1(c) 
[ShVZ 77, ShVZ 79b, BiW 84], whimsically named because of a rough resem- 
blance to this antarctic creature. The gluonic penguin is noteworthy because it is 
purely AJ = 1/2, thus helping to build a larger AJ = 1/2 amplitude, and because 
it is the main source of CP violation in the AS = 1 hamiltonian. The electroweak 
penguin, wherein the gluon is replaced by a photon or a Z? boson, also enters 
the theory of CP violation. The CP-conserving portion of the penguin diagrams 
involves a GIM cancelation between the c, u quarks and hence enters significantly 
at scales below the charmed quark mass. On the other hand, in the CP violating 
component, the GIM cancelation is between the ¢, c quarks and thus this piece is 
short-distance dominated. At lowest order, before renormalization-group enhance- 
ment, one obtains the following Hamiltonians for the penguin and electroweak 
penguin interactions (cf. Fig. VIII-2), 
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Fig. VUI-2 Penguin diagram. 


G ; 2 2 

Hens) = — oa Miata In —£ Me >+ Vă Vs ln — Mi m Jana + ys)Msqy' rq, 
2G ra ei m°] - = 

HD = Ji Rae In —> ie > + VigVis In a dy, + ys)dqy" aq. 


(3.30) 


We have used a scale upg instead of the up quark mass and have quoted only the 
logarithmic m, dependence. The quarks q = u,d,s are summed over and Q, 
is the charge of quark q. Note that since the vector current can be written as a 
sum of left-handed and right-handed currents, this is the only place where right- 
handed currents enter Hw. The gluonic penguin contains the right-handed current 
in an SU (3) singlet, hence retaining the (8z, 1g) property of Hw. However, the 
electroweak penguin introduces a small (8%, 8g) component. 
The full result can be described with the four-quark AS = 1 operators, 


O; = Ha — Hpg, 04, = Ha+ Hg — He, 
Or = Ha + Hg +2Hc +2Hp, O5 = dy, + ys)a*s yul — ys)a%q, 
03 = H4+ Hg +2Hc —3Hp, Oo = dy (l + y5)s qYu (l — ys)q, 
3 _ 
07 = zall + ys)d qy" (1 — ys)Qqq, (3.31) 
3 _ 
Og = — 581% + ys5)dj giv" — ys)Qqqi, 


where q = u, d, s are summed over in O5,6,7,8, i and j are color labels, Qg is the 
charge of quark g and 


Ha = dy, (1+ ysyu ty" + ys)s, Hc = dy, (1 + ys)s dy" (1 + ys)d, 
Hg = dy, (1+ ys)sty"(U+ysu, Hp = dy, (1+ ys)s 5y" (1 + ys)s. 
(3.32) 
The operators are arranged such that O1,2,5,6 have octet and AJ = 1/2 quantum 
numbers, O3(04) are in the 27-plet with AJ = 1/2(AI = 3/2), while O7, arise 
only from the electroweak penguin diagram. The full hamiltonian is 
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Hass! = = Via be O;. (3.33) 


A renormalization-group analysis of the coefficients [BuBH 90] yields 


Cı = 1.90 — 0.62r, Cs = —0.011 — 0.079r, 
C = 0.14 — 0.020r, C = —0.001 — 0.029x, 
(3.34) 
= (4/5, C; = —0.009 — (0.010 — 0.004r)a, 


C4 = 0.49 — 0.0057, Cg = (0.002 + 0.160T)a, 


with A œ 0.2 GeV, wr œ~ 1 GeV, m, = 150 GeV, and t = —ViVis/ V% Vas- 
The number multiplying t has a dependence on m, whereas (within the leading 
logarithm approximation) the remainder does not if m, > Mwy. The t depen- 
dence in C4 arises only because of the electroweak penguin diagram. This hamilto- 
nian summarizes the QCD short-distance analysis and is the basis for estimates of 
weak amplitudes. For a treatment of corrections beyond those of leading order, see 
[BuBL 96]. 


VIlI-4 The AJ = 1/2 rule 
Phenomenology 


In the decays K — sz, the S-wave two-pion final state has a total isospin of either 
0 or 2 as a consequence of Bose symmetry. Thus, such decays can be parameterized 
(ignoring the tiny effect of CP violation) as 


: Ao . 

ô 2 iô 
AxK0_sn+7- = Ao e' 0 + T a 

id id 
Axo_,7070 = Age’ — V2 Are! 2 


Agent A, e’, (4.1) 
where the subscripts 0, 2 denote the total zz isospin and the strong interaction 
S-wave mz phase shifts ôy enter as prescribed by Watson’s theorem (cf. 
Eq. (C-3.15)). There are, in principle, two distinct J = 2 amplitudes Az and A}. 
These are equal if there are no AJ = 5/2 components in the weak transition, as is 
the case in the Standard Model if electromagnetic corrections are neglected. Includ- 
ing electromagnetism leads to a small difference between Az and A/ [CiENPP 12], 
but we will neglect this possibility from now on, and employ just the two isospin 
amplitudes Ag and A2. The experimental decay rates themselves imply 
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|Axosn+2-| = (2.772 + 0.0013) x 1077 GeV, 
|Ako 7070| = (2.592 + 0.0022) x 1077 GeV, 
|Ac+4+70| = (0.1811 + 0.0004) x 1077 GeV. (4.2a) 


The zz phase difference 69 — 52 can be obtained via 


J3 |A+_|? — |Aool? + 2A 4017/3 
cos(d9 — 62) = . 1/2? 
272 | Aol [2I A+- + lAo]? — 414 4012/3] 
o — ô2 = (44.55 + 1.04)°. (4.2b) 


This is consistent with the phase difference which emerges from the analysis of 
ma scattering. The magnitude of the isospin amplitudes can be found to be 


|Ao| = (2.711 + 0.0011) x 107” GeV, 
|A>| = (1.207 + 0.0026) x 1078 GeV. (4.3) 


The ratio of magnitudes, 
|A2/Ao| = 0.0445 + 0.0001 > 1/22.47, (4.4) 


indicates a striking dominance of the AJ = 1/2 amplitude (which contributes to 
Ag) over the AJ = 3/2 amplitude (which contributes only to A2). This enhance- 
ment of Ag over Az, together with related manifestations to be discussed later, is 
called the AJ = 1/2 rule. As we have seen in previous sections, a naive estimate 
(and even determinations which are less naive!) do not suggest this much of an 
enhancement. However, the factor 22.5 dominance of AJ = 1/2 effects over those 
with AJ = 3/2 is common to both kaon and hyperon decay.’ 

A similar enhancement of AJ = 1/2 is found in the K — zzz channel. In 
this case, it is customary to expand the transition amplitude about the center of the 
Dalitz plot. For the decay amplitude K(k) —> (pi) m(p2) m(p3), the relevant 
variables are 

2 
si = (k — p? ; s= 5 (61 + 52+ 55) = E tm, 
i=1,2,3 (4.5) 
_ Sy} — S2 z 53 — SO 
x= ——_> y= 
So So 


where s3 labels the ‘odd’ pion, i.e. the third pion in each of the final states 7*+2~ 3°, 


mn °n*, trta. The large AJ = 1/2 amplitudes are considered up to quadratic 
3 Although our discussion stresses the relative magnitudes of the AJ = 1/2, 3/2 amplitudes, the relative 


phases of these amplitudes turns out to place important restrictions on the structure of the |AS| = 1 
nonleptonic hamiltonian [GoH 75]. 
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order in these variables while the AJ = 3/2 amplitudes contain only constant plus 
linear terms, 


; 2 : 
PDA pi cient = aj = 2a3 + (by = 2b3) vel! = = by xe! 
3 


1 1 ; 
+c F ax +d FE edu 
3 3 
1 
V2 Apos ror0z0 = 3 (a1 — 2a3) + 3c G + 3) ; 


Axtontntn- = 2(a1 +43) — (bi + bs) ye! + bog vel! 


1 1 
+ 2¢ G + 5%) +d (5° =- 52) etm, 


Axtsnn0nt+ = a +43 + (bi + bs) ye! + bog pel 
1 1 , 
+c (5° + 5) +d (¥ = 5”) elem (4.6) 


where a1, bj, c, d are AI = 1/2 amplitudes, a3, b3, b23 are AJ = 3/2 amplitudes, 
and the phases {ô;} in dy; (= dy — 6,) and 52; (= 52 — 4,) refer to final-state phase 
shifts in the J = 1,2 and mixed symmetry Z = 1 states respectively. Because of 
the relatively small Q value for such decays (Qrrr = mx — 3m, ~ 75 MeV), 
such phases are presumably small and are often omitted. Also, this representation 
in terms of simple energy-independent phase factors is clearly idealistic. Analysis 
of the available data [BiBD 03] yields (in units of 1077) 


a, = 9.32 + 0.04, az = 0.34 + 0.03, 
bı = 14.2 +40.2, b, =—0.6+0.1, bz = 2.7 0.3, (4.7) 
c=-11405, d= =30 £0.8. 


Dominance of the AJ = 1/2 signal is again clear in magnitude and in slope terms, 
e.g., we find at the center of the Dalitz plot, 


|a3/a,| 1/27. (4.8) 


In SU (3) language, the dominance of AJ = 1/2 effects over AJ = 3/2 implies 
the dominance of octet transitions over those involving the 27-plet. This is a conse- 
quence, within the Standard Model, of the fact that the AJ = 1/2 27-plet operator 
contributes relative to the AJ = 3/2 27-plet operator with a fixed strength given 
by the scale-independent ratio of coefficients C3/C4 ~ 1/5 (viz. Eq. (3.34)). The 
27-plet operator then gives only a small contribution to the AJ = 1/2 amplitudes, 
with the major portion coming from the octet operators. We shall therefore ignore 
the AJ = 1/2 27-plet contribution henceforth. 
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Chiral lagrangian analysis 


The left-handed chiral property of the Standard Model may be directly tested by 
the use of chiral symmetry to relate the amplitudes in K —> zzz to those in 
K — az. We have already constructed the effective lagrangians for (8z, 1p) and 
(27L, Lr) transitions. Dropping on *) the nonleptonic decays are described by the 
two parameters gg and fe ? at O(E”). Let us see how well this parameterization 
works, and afterwards add O(E*) corrections. The two free parameters may be 
determined from Ap and Az in K — xr decays. From the chiral lagrangians of 
Eqs. (2.4), (2.8b), we find 


Viss Ti 
o= T (mk — m-) r A = a (mk — mŽ) : (4.9) 
which yields upon comparison with Eq. (4.3), 
gg X 7.8 x 10-8 F?, eo 2025 x 10°F, (4.10) 


The K — zzz amplitude may be predicted from these. Because there are only 
two factors of the energy, no quadratic terms are present in the predictions, 


(1/2) _ /2Aom 1 m% + 3m? = 
Kl>ata—n  6F (m — m2) m% slg 
(3/2) Am? 5 my + 3m2 _ 
Ao. gtn-n =: 2 2 1— 2 ; 
L 3 Fr (mkg — m2) 4 mg 
Am? m +3m2 _ 
AG?) = K f 44-8 S|, (4.1 1a) 
Ktsntnatn 3F, (m2. _ m2) m% 
which correspond to the numerical values (again in units of 1077), 
(1/2) 7 
ARO T =7.5 +9.19, 
(3/2) 5 
A ea = —0.47 + 0.74 7, 
ACP tnta- = 0.47 +2.35. (4.11b) 
These are to be compared to the experimental results, 
Av a go = 932 + 14.25 — 6.1 7 + 1.327, 
L 
A ara = —0.68 + 1.27, 
ACD ag = 0.68 + 3.35. (4.12) 


This comparison can be seen in Fig. VHI—3, where a slice across the Dalitz plot is 
given. Also shown are the extrapolations outside the physical region to the ‘soft- 
pion point’ where either pt — 0 or pj) — 0. Predictions at these locations are 
obtained by using the soft-pion theorem (see Prob. VII—2). 
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Fig. VIN-3 Dalitz plot. 


The chiral relations clearly capture the main features of the amplitude and 
demonstrate that the K — 3m AI = 1/2 enhancement is not independent of 
that observed in K —> 2x decay. However, for the AJ = 1/2 transitions we may 
do somewhat better. The kinematic dependence of x? or y? can come only from a 
chiral lagrangian with four factors of the momentum, and only two combinations 
are possible: 


Aguad = Vik + pop+ > P- + y2 (k + ky pos p- +k + p-po- p+). (4.13) 
Such behavior can be generated from a variety of chiral lagrangians, 


Laua = 8g Tr (A6d,U0"U'I,U0"U") 
+ gf Tr (A69,U0,U'I"U AU") +- . (4.14) 


However the predictions in terms of y; are unique. Fitting the quadratic terms to 
determine y1, y2 yields the full amplitude, 


V2 AL = (9.5 + 0.7) + (16.0 + 0.5) y — 4.85 y +0.88%2, (4.15) 


K9-3ntn-719 
which provides an excellent representation of the data. Final-state interaction 
effects also provide an important contribution and must be included in a complete 
analysis [KaMW 90]. Note that in the process of determining the quadratic coeffi- 
cients, the constant and linear terms have also become improved. This process can- 
not be repeated for AJ = 3/2 amplitudes due to a lack of data on quadratic terms. 


Vacuum saturation 


Direct calculations of nonleptonic amplitudes have proven very difficult to per- 
form. On the whole, no single analytical or numerical method for overcoming the 
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general problem yet exists. In the following, we describe the simplest analytical 
approach, called vacuum saturation, which often serves as a convenient bench- 
mark with which to compare the theory. For convenience we consider only O; (the 
largest AJ = 1/2 operator) and O; (the AJ = 3/2 operator), 


Hw = ERVAN CiO + C404), (4.16) 
with Cı ~ 1.9 and Cy ~ 0.5. The vacuum saturation approximation consists of 
inserting the vacuum intermediate state between the two currents in all possible 
ways, e.g., 

(x* (py) (p_) |dy" (1 + ys) uity" (1 + ys) s| K°(K)) 
= (m~ (p_) |dy"ysu| 0)(x* (p+) |#y"s| K°(K)) 
+ (1+ (p+) (p-) |ipy“ta| 0) (0 day" yssp| K°(k)) 


i 
= iV 2 Fe f+ BE + Pip — V2 Fee frku (P- = p+)". (4.17) 
In obtaining this result the Fierz rearrangement property 


day” (1+ ys) Uaitgy"” (1 + ys) 8g = day” (1 + ys) sgūgy” (1 + Ys) Ua 


has been used, where a, 6 are color indices which are summed over. In addition, 
the color singlet property of currents is employed, 


Sap 
(0 |da yuyssg| K°(k)) = iv2 Feky a (4.18) 


Within the vacuum saturation approximation, we see that the amplitudes are 
expressed in terms of known semileptonic decay matrix elements. Putting in all 
of the constants, we find that 


G 
Ao = Z Vi Vos Fx (mk — mz) Cı = 0.34 x 1077 GeV, 
IIG 
sy z E Y* VF, (mk — m2) Cy = 2.5 x 10°” GeV. (4.19) 


The above estimate of Az is seen to work reasonably well, but that of Ag falls 
considerably short of the observed AJ = 1/2 amplitude. This demonstrates that 
vacuum saturation is not a realistic approximation. However, it does serve to indi- 
cate how much additional AJ = 1/2 enhancement is required to explain the data. 


Nonleptonic lattice matrix elements 


While historically there have been many attempts to improve on the naive vacuum 
saturation method, the present state of the art is to rely on lattice calculations. 
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However, nonleptonic matrix elements have been a particular challenge for lattice 
methods. The transition from a kaon to two pions requires three external sources to 
create the mesons involved, as well as a singular four-quark operator for the weak 
hamiltonian. In addition, there are diagrams where quarks in the hamiltonian form 
disconnected loops not connected to external states. Recent advances have allowed 
the extraction of the Az amplitude with reasonable precision [B1 et al. 12] 


|A2| = (1.381 + 0.046 cat + 0.258syst) x 1078 GeV (4.20) 


consistent with the experimental result of Eq. (4.3). However, the isospin-zero final 
state in Ao implies the existence of disconnected diagrams, which make the numer- 
ical evaluation difficult, and we do not yet have a reliable lattice calculation of Ag 
[B1] et al. 11]. 


VIII-5 Rare kaon decays 


Thus far, we have discussed the dominant decay modes of the kaon. There are, how- 
ever, many additional modes which, despite tiny branching ratios, have been the 
subject of intense experimental and theoretical activity. We can divide this activity 
into three main categories. 


(1) Forbidden decays — These include tests of the flavor conservation laws of 
the Standard Model such as K; — e*u”. Positive signals would represent 
evidence for physics beyond our present theory. 

(2) Rare decays within the Standard Model — These include decays which occur 
only at one-loop order. Such processes can be viewed as tests of chiral dynam- 
ics as developed in this and preceding chapters (e.g., radiative kaon decays) 
or as particularly sensitive to short-distance effects, which probe the particle 
content of the theory. 

(3) CP-violation studies — There is now confirmation of CP-violating processes 
involving kaons and B mesons (and searches of the same for D mesons). Also, 
the observed baryon—antibaryon asymmetry of the Universe requires the exis- 
tence of CP violation within the standard cosmological model. There remain, 
however, interesting opportunities for further studies of CP violation within 
the subfield of rare kaon decays. 


Any of these have the potential to yield exciting physics. We shall content ourselves 
with discussing only a small sample of the many possibilities. Surveys of rare kaon 
modes appear in [CiIENPP 12] and also in [RPP 12]. 

Consider first the rare decay K* —> mtv, where the neutrino flavor v = 
Ve, Vu, Vr is Summed over. This mode is often called ‘K * to x+ plus nothing’, 
in reference to its unique experimental signature. This process can take place only 
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Fig. VIII-4 The decay K+ > mT veve. 


through loop diagrams, such as the ones in Fig. VII—4. We content ourselves here 
to show just the effective hamiltonian for the dominant t-quark loop at leading 
order in OCD, [InL 81, HaL 89] 


Gr a 


en ae 
= J2 2r sin? 4, 


VieViaXo(xr) D> dey + ys)ve Syu + ys)d, (5.1) 
Z 


where x, = m?/m4, and 


Xt Xt + 2 3x; as 6 
X = : 5.2 
ox) 8 (= Z] + a 5) (5.2) 


The overall factor of m? / my in Xo(x;) is associated with the GIM effect; the 
c-quark and u-quark amplitudes, were they included, would contain analogous 
mass factors such that in the limit m, = me = m, the total amplitude would 
vanish via GIM cancelation. Although the calculation of many hadronic processes 
in this book are hindered by QCD uncertainties, such is not the case here. The 
quark matrix element is related by isospin to the known charged current amplitude 


(r+ (p) |Sy,d| K*(k)) = V2 (x° (p) |5y,u| K*(K)) = f(g) (k+ p), 6-3) 


with f,(0) = —1. This makes the K* — zt vv, example a theoretically ‘clean’ 
process and is responsible in large part for all the attention this transition has 
attracted. Our discussion is relatively brief, and a more careful analysis would 
include effects like QCD perturbative corrections, the c-quark loop contribution 
(roughly 30%), etc. A recent prediction, [BrGS 11], along with the current experi- 
mental result [RPP 12], reads 


Brite). = (7.8 £0.8) x 10-4, BrP = (1.71.1) x 107”. 


Ktrt wig Ktomtveig 
(5.4) 


The experimental result, still consistent with zero, is reaching the sensitivity needed 
to probe the theory prediction and further advances are anticipated. The main 
source of uncertainty in the theory prediction is from CKM factors and quark mass 
values, which presumably can and will be improved upon. 
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Fig. VIII-5 Long-distance contributions to radiative kaon decays. 


A different class of rare decays consists of the radiative processes Ks —> yy 
and K; — w°yy. These transitions provide interesting tests of chiral perturbation 
theory at one-loop order. In this case, the long-distance process, Fig. VIII-5, is 
dominant. An important feature is that there is no tree-level contribution at order 
E? or E* from any of the strong or weak chiral lagrangians because all of the 
hadrons involved are neutral. Thus, the decays can only come from loop diagrams, 
or from lagrangians at O(E°). There is also an interesting corollary of this result 
concerning the renormalization behavior of the loops. Since there are no tree-level 
counterterms at O(E*) with which to absorb divergences from the loop diagrams, 
and recalling that we have proven all divergences can be handled in this fashion, it 
follows that the sum of the loop diagrams must be finite. This is in fact borne out 
by direct calculation. 

For Ks — yy, the prediction of chiral [D’ AE 86, Go 86] loops is given in terms 


of known quantities such as 
2 
Mk 
r (Zi 
T 


amk gF2 ( m2)’ 
167? 

F(z) = 1 = z [7° — In? O(z) — 27i In Q(2)], 

1—41 -— 4z 


ttyl 


where gg is the nonleptonic coupling defined previously in Eq. (2.4). Comparison 
of the theoretical one-loop branching ratio and the experimental result, 


2 


Pkssyy — 


? 


Q(z) = (5.5) 


Br =2.0x 107%,  BrẸ™ = (2.63 +0.17) x 10°, (5.6) 


Ks>yy Ks>yy 


shows reasonable agreement but implies the need to consider O(E°) corrections. 
In particular, the ‘unitarity correction’ Ks —> mta — yy has been shown to 
provide an improved theoretical prediction [KaH 94]. 

The case of K; —> z°yy is also instructive. Again, one-loop contributions are 
finite and unambiguous [EcPR 88]. Indeed, we know that K; > m°yy and K; > 
yy are related by the soft-pion theorem in the limit pt — 0, yielding 
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A eandyy _ FE 
ee ee ae Z, — 
dz (dr) m2, 


2 


m; mk m; 
x=- Fie- )t Le F(z)| , (5.7) 
Mk mM; Mk 
where z = m? „/m% and 
Ma b) = 1 +a? +b? — 2(a +b + ab). (5.8) 


If we compare the theoretical branching ratio based on the above description with 
the experimental value, we find 


= (1.273 + 0.033) x 10°°. (5.9) 


SW yy 


(loop) _ —6 
Brk, oe 0.68 x 107”, Brpo 


This indicates the need for an O(E°) correction. Indeed, the most recent data input, 
from [Ab et al. (KTeV collab.) 08], provides evidence for a vector exchange contri- 
bution. It is easy to take this into account, viz. the diagram of Fig. VHI-—5(c) shows 
the effect of p-exchange. 


Problems 


(1) Kaz decay 
The ratio f its +x! OVF Pa) of semileptonic form factors is a measure of 
isospin violation. Part of this quantity arises from 19-78 mixing. 
(a) By diagonalizing the pseudoscalar meson mass matrix, show that ma Æ Mu 
induces the mixing |z?) = cos € |y3) +sin € |v) where € ~ V3(mq —m,)/ 
[4(m, — m)] and m = (m, + mg)/2. 
(b) Demonstrate that this leads to the result (cf. Eq. (1.11)) 


FE" 0) cosefk'” (0) + sinefX * (0) 
fF" 0) hae) 


ans 1+V3sine. 


(2) Soft pions and K — 3x decay 
The results derived in Sect. VII—4 with effective lagrangians can also be 
obtained by means of soft pion methods (see App. B-3). 
(a) Using the soft pion theorem, show that the soft-pion limit of the K — 3x 
transition amplitude is given by 
lim (re mgg Hov OK) = Farell OS, Hw (OK), 


where Qf = fx Ag (x, t) is the axial charge. 
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(b) Demonstrate that this may be also written as 


—i 
lim (0g, 9, gel HOKE) = F gTa lO, How OMUKE), 


where Q* is an isotopic spin operator, and hence that 


z of 30) I 0 
erat Tg lHa (O)| Ky) = 2F Fp Meat 
fi + _— _0 H! 0 K? Z A! — A! 
cael w0)|K,) = =a K9-+ 7979 Kortn)? 
+> 
I ny __ 
HE a, q+ Tq- pape (0)| Ko )= FA bo tne 
— A! : A! 


Kota + 5 Caqigih 


where J = 1/2, 3/2 signifies the isospin component of the quantities in 
question. 

(c) Use a linear expansion of the K — 3z transition amplitude, (i.e. Eq. (4.6) 
with c = d = 0) to reproduce the results of Eq. (4.11), up to corrections of 
order m2. 


IX 


Mass mixing and CP violation 


Aside from a concluding section on the strong CP problem, this chapter is about 
the CP violation of kaons. We set up the general framework for meson—antimeson 
mixing, which is also used in the weak interactions of heavy quarks, treated later 
in Chap. XIV. In this chapter we apply the formalism to K°-K° mixing and CP- 
violating processes involving kaons. 


IX-1 K°-K° mixing 


It is clear that K? and K? should mix with each other. In addition to less obvious 
mechanisms discussed later, the most easily seen source of mixing occurs through 
their common mz decays, i.e., K’ <> am <> K°. We can use second-order per- 
turbation theory to study the phenomenon of mixing. Writing the wavefunctions in 
two-component form 


t - 
Iy) = (o ') = a(t)|K°) +b |K’), (1.1) 
(t) 


we have the time development 


d E i 
Ve (m = Sr) IVO), (1.2) 


where, to second order in perturbation theory, the quantity in parentheses is called 
the mass matrix and is given by! 


l The factors 1 /2m x are required by the normalization convention of Eq. (C-3.7) for state vectors. 
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[u 7 r] _ (K} Herr K?) 
ij 


2mK 


(K? [Hwyl Kj) 1 

(0) 4 J 

= Ôij 

Mk St T 2mK = 2nK 2, 


n 


(K? Halan) (n|Hw|K}) 


m® — E, + ie 


(1.3) 
Here, the absorptive piece I arises from use of the identity 
1 
————— =P — in d(E, — w), 1.4 
w— E, +1€ (=) di e Lie 
and hence involves only physical intermediate states 
1 
Tiy = — ) (K? Hla) (n| HwlK?)2m8 (En — mx). (1.5) 


` 2mxK z 


Because M and I are hermitian, we have M2; = Mj, and P2; = r{,. The diagonal 
elements of the mass matrix are required to be equal by CPT invariance, leading to 


a general form 
i A p 
M--T= ; (1.6) 
7 2 
q A 


where A, p°, and q? can be complex. The states K? and K° are related by the 
unitary CP operation, 


CP|K°) = &x|K°) (1.7) 


with |&x |? = 1. Our convention will be to choose £g = — 1. The assumption of CP 
invariance would relate the off-diagonal elements in the mass matrix so as to imply 


P=4q;, 
(K°|Hepe|K°) = (K° | (CP)! CP Hett (CP)~'CP| K’) = (K° (Herl K°), (1.8) 


where (K° [Hetl K?) is defined in Eq. (1.3). Combined with the hermiticity of M 
and I’, this would imply that Mj and Ij are real. In the actual CP-noninvariant 
world, this is not the case and we have instead for the eigenstates of the mass 
matrix, 
1 _ 
|K.) = —=——— [p|K*) + q|K)], (1.9) 


‘vip? + lal? 


where, from the above discussion, 


p Mn — iln i se 
= ,|————,, Mn- Tp =(K° |H] K°). (1.10) 
q Mh — Vp 2 
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The difference in eigenvalues is given by 
i 
2qp = (m; — ms) — rae = Is) 
i 1/2 i 1/2 
=2( Ma Tn) (iz - Ti) x~ 2Re Mj. —iRe Fy, (1.11) 


where the final relation is an approximation valid if CP violation is small (1 > 
Im Mı2/Re Mj2). The subscripts in Kz and Ks, standing for ‘long’ and ‘short’, 
refer to their respective lifetimes, whose ratio is substantial, tz/ts œ~ 571. To 
understand this large difference, we note that if CP were conserved (p = q), these 
states would become CP eigenstates K? (not to be confused with the charged kaons 
K=; 


|Ks) = |K°), |K) pea |K°), 
IK?) = z [IK°) =1K%)],  CPIK}) = K$). 


(1.12) 


In this limit, which well approximates reality, Ks would decay only to CP-even 
final states like 22, whereas K; would decay only to CP-odd final states, e.g., 37. 
Since the phase space for the former considerably exceeds that of the latter at the 
rather low energy of the kaon mass, Ks has much the shorter lifetime. The states 
Ks,L, expanded in terms of CP eigenstates, are 


sl [jx +ajxgy], 2 = LE 
|Ku) = aae a q _ 127 


p-q _ i IMMy-ilmMlp/2 _ 1 Mn- Ma — $n — Tn) 
ptq 2 ReMy—iReT2/2~ 2 m,—ms — (T, -Ts) 


E= (1:13) 


K?-K?° mixing can be observed experimentally from the time development of a 
state which is produced via a strong interaction process, and therefore starts out at 
t =0 as either a pure K? or K®, 


[KCt)) = g4(O1K°) + pe OR), 


[KCr)) = 781K") + 94(t)|K°), 


1 : : 
g(t) = > as [1 + g Eee am] , (1.14) 


where AT = I's -—Tz;z and Am = m; — ms, each defined to be a positive quantity. 
From such experiments, the very precise value 


AMexpt = (3.484 + 0.006) x 107!? MeV (1.15) 


has been obtained. 
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Fig. IX-1 Box (a),(b) and other contributions to CP violation. 


CP-conserving mixing 
There are two main classes of contributions, associated respectively with the short- 
distance box diagrams of Fig. IX-1(a),(b) and the long-distance contributions like 
those in Fig. IX-2, 
Amtheory = (Am)ao iy + (Am) Re (1.16a) 


theory theory * 


We shall consider the first of these here, the short distance component 


(Am) heory = 2Re(K° |[Hy*| K (1.16b) 
Determining (Am)ixeory has long been, and continues to be, a significant topic in 
kaon physics. It involves almost all the field theory tools we describe in this book. 
Our discussion will of necessity include some advanced features in order to present 
a realistic picture of the current state of the art. 

The construction of H?%* follows a standard procedure: to a given order of QCD 
perturbation theory, first specify the Wilson coefficient at the scale u = My, then 
use the renormalization group (RG) to evolve down to a hadronic scale u < me 
and finally match onto the effective three-quark (i.e. u, d, s) theory. The result of 
this is 

T= Cw 0^, (1.17) 
OAS=2 


where is the local four-quark operator 


ors = dy (1 + ys)s dy"(1 + ys)s, (1.18) 


and C (u) is the corresponding Wilson coefficient, 


wnt RO ep 
a 


(a) (b) 


Fig. IX-2 Long-distance contributions to K? — K? mixing. 
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2 
C(u) = fs [Hamn F E? H (xM? Nu T 28.&,G (Xe, smn [OU , 


167? 
(1.19) 


with & = Vň Vis (i =c, t) and x; =m?/Mj,. The above expression for C(u) is 
more complicated than the C+(u) encountered in our earlier AS =1 discussion 
(cf. Eq. (VUI-3.11)) because the box amplitude for AS =2 has loop contribu- 
tions from all the u, c, t quarks. Actually, Eq. (1.19) has already been simplified 
in that CKM unitarity has allowed removal of &, and the tiny mass of the u quark 
has been neglected with respect to the heavy-quark masses me, m;. The quanti- 
ties H(x,), H(x-) and G(x., x;) in Eq. (1.19) are so-called Inami—Lim functions 
[InL 81] that describe the quark-level loop amplitudes of Fig. [IX—1(a),(b) in the 
no-QCD limit, 


Hosta a -e 
— = = n 
"=g fig 2 0-a] oa 


ex ee eee 
= y| — — x 
MATAH paged aer 4 =x? 


3 1 
TO Taa 20 


This leaves in Eq. (1.19) the factors Nec, Nit, Ner, and b(u). These arise from 
calculating perturbative corrections to H?*.? Such corrections will contain depen- 
dence on both the scale (u) and renormalization scheme (say, the NDR approach, 
described in App. C-5). These cannot be present in the full amplitude and must be 
cancelled by analogous dependence in the matrix element (K° |O“S=?| K°). For 
convenience, the scale and scheme dependence present in the Wilson coefficient 
C(u) is placed into the factor b(jz), which for K oo a mixing has the perturbative 
form 


CO 
= n æs(u)" = as (u) 
ose A a i TAW fre ge], 
n=0 a T 


(1.21) 


Scheme dependence first appears in J‘) via the anomalous dimension y ® of oper- 
ator 0^5=2, 


2 Itis customary to classify corrections according to the order of QCD perturbation theory used to determine 
them, e.g. leading’ (LO), ’next-to-leading’ (NLO), ’next-to-next-to leading’ (NNLO) and so on. It is 
disturbing that nNNLO ~ 1.87 is about 36% larger than 7nNLo ~ 1.38. This is an unexpectedly large result, 
one which warrants further study. 
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yO BY y B 153 — 19n; 1 4np-—63 
a 280 T "2° G3 mpi 6 33-an 


pr (1.22) 
shown here for n flavors and in NDR renormalization. The above expression for 
b(n) serves at the same time to define the perturbative factors Nec, Nr, and ner. For 
completeness, we display the most recent determinations [BrG 12] of the {7;} (with 
perturbative order shown as well), 


nAN O = 1.8706) nN = 0.5765(65) nO = 0.496(47). (1.23) 


The determination of (K° | gss =2(w)| K°) at a hadronic scale jz < me involves 
nonperturbative physics, so its evaluation by analytical means is problematic. It has 
become standard to express this quantity relative to its vacuum saturation value and 
introduce a parameter Bx as 


= = 16 
(K° ON (| K?) = 3 Femk Bru), (1.24) 


with Fx = 110.4+0.6 MeV.’ There has been substantial progress in the calculation 
of nonperturbative quantities such as Bx using lattice QCD methods. The scale- 
and scheme-independent version is defined as 


Br = b(w)Br(u) (1.25) 
and the value used in [BrG 12] is 
By = 0.737 + 0.020. (1.26) 


Finally, given present values for the CKM elements and the t-quark mass, the 
most important contribution to the real part of H°°* is found to be from the c 
quark. In view of this, and noting that H (xe) ~ 1 (cf. Eq. (1.20)), we then have 


G2 
Re Fo" ~ n Re (Vii Ves) 


2 


i nO (1.27) 


At this point, we have the ingredients for determining (Am) heory and one obtains 
[BrG 12] 


(Am) y = (3.1 £ 1.2) x 10715 GeV, (1.28) 


which is consistent with the value cited for Amexp in Eq. (1.15) within the quoted 
uncertainty. 


3 The reader should be wary of occasional notational confusion between Fx and fx = J2FK. 
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(b) 
Fig. IX-3 Mechanisms for CP violation. 


IX-2 The phenomenology of kaon CP violation 


The zz final state of kaon decay is even under CP provided the strong interactions 
are invariant under this symmetry. For the mlm? system, this is clear since 7° is 
itself a CP eigenstate, CP|2°) = — |7°), and the two pions must be in an S-wave 
(£ =0) state, 


CP\r°n®) = (—1)?(—1)$|2°n®) = +r’). (2.1) 


The corresponding result for charged pions reflects the fact that z* and x7 are CP- 
conjugate partners, CP|7~) = — |x}. We have seen that if CP were conserved, 


the two neutral kaons would organize themselves into CP eigenstates, with only 
Ks decaying into zz. Alternatively, Kz; decays primarily into the mx final state, 
which is CP-odd if the pions are in relative S waves. The observation of both 
neutral kaons decaying into zz is then a signal of CP violation. 

There can be two sources of CP violation in K; — mz decay. We have already 
seen that K°—K° mixing can generate a mixture of the CP eigenstates in phys- 
ical kaons due to CP violation in the mass matrix. There also exists the possi- 
bility of direct CP violation in the weak decay amplitude, such that the CP-odd 
kaon eigenstate |K?) makes a transition to 27. These two mechanisms are pic- 
tured in Fig. IX-3. The Kara decay amplitudes have already been written down 
in Eq. (VII-4.1) in terms of real-valued moduli Ag, A2, and pion—pion scatter- 
ing phases ôo, 52. This decomposition is a consequence of Watson’s theorem, and 
relies in part upon the assumption of time-reversal invariance. However, if direct 
CP violation occurs, Ag and Az can themselves become complex-valued, 


Ao = |Aole®, = Az = |Aale”®?, (2.2) 


with CP violation in the decay amplitude being characterized by the phases éo and 
&. Consequently, the Ko — ma and Ko — saz decay amplitudes assume the 
modified form 


if) i Aol je ; 
Arts = |Aolee!> + a ete’, 


TN” A io 
Ajzo_.ntn- = —|Aole el? — ál its ida (2.3) 
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Using the definitions of Kz; and Ks in Eq. (1.13), a straightforward calculation 
leads to the following measures of CP violation: 


ern [elke (0° \HwlKr) 


=n- Hert, = =e-—2e', (2.4 
(atn [Hal Ks) t (xn HK) oe 
where 
e€ = € + i, 
+ „i (82—80) ; pi (82—80) 
le A2 le A2 Im A2 Im Ao 
€=—— |= | (2 -—) = = - (2.5) 
V2 Ao y2 Ao Re A> Re Ao 


The expression for e can be simplified by approximating the numerical value 
Am/AY =0.475 + 0.001 by Am/AT ~ 1/2. This yields the approximate relation, 


i eit/4 1 


: ~ , (2.6) 
Am + ZAT v2 Am 


which we shall use repeatedly in the analysis to follow. In addition, since the rate 
for K — mz is much larger than that for K > maz, and K? —> mm is in turn 
dominated by the 7 = 0 final state because of the AZ = 1/2 rule, we have 


Im Tj. > Eor s = 2& Am. (2.7) 
The above relations allow us to write 
z+ ik et (Im Mi iy) +ik 
e=é€+i& x —i i 
0 Va hn 0 0 


et (Im Mp et /ImMp  ImAo 
= + éo | = E , 
Am J2 2Re Miz Re Ao 


= 12 giog, siege: iwe!~*) (Im Az Im Ao 
V2 J2 Re A2 Re Ao , 


where w = Re A2/Re Ag œ 1/22. All CP-violating observables must involve an 
interference of two amplitudes. In Eq. (2.8), the quantity € expresses the interfer- 
ence of K? —> az with K’? > K? — xr, while e’ involves interference of the 
I =O and / =2 final states. 

The formulae for € and €’ exhibit an important theoretical property. Since the 
choice of phase convention for any meson M is arbitrary, its state vector may be 
modified by the global strangeness transformation |M) — e'*5|M). For the K?’ 
and K? states, this becomes 


(2.8) 


|K?) = eR, [A —> e] K’), (2.9) 


which has the effect, 
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Im A; Im A; Im Mi? Im Mi? 
=> + ; > a 
Re Al Re Ar Re Miz Re Miz 


2. (2.10) 


We see that the values of € and €’ are left unchanged. Various phase conventions 
appear in the literature. In the Wu—Yang convention, à is chosen so that the Ag 
amplitude is real-valued. This is always possible to achieve by properly choosing 
the phase of the kaon state. However, it is inconvenient for the Standard Model, 
where the Ao amplitude naturally picks up a CP-violating phase. We shall therefore 
employ the convention in which no such additional phases occur in the definitions 
of the kaon states. 

It was in the K — zz system that CP violation was first observed. The current 
status of measurements is 


je] = (2.228 £ 0.011) x 107°, 
Re (£) = (1.66 + 0.26) x 1073, 
E 


g+- = phase(ņn+-) = (43.51 + 0.05)° 
Poo = phase(noo) = (43.52 + 0.05)°. (2.11) 


A violation of CP symmetry has also been observed in the semileptonic decays 
of Kz and Ks. These are related to matrix elements of the weak hadronic currents. 
Since K? must always decay into e+ v.2~ while K° goes to e7 pert, we have 
= AK0_,2-et ve ’ 
l-—e 
AK, —>nte-ig = Aa A ree er ae 


AK, —>n-et+v, 
(2.12) 


If the semileptonic decays proceed as in the Standard Model, there is no direct CP 
violation in the transition amplitude, so that 


I ki>r-etve _ 1 +2Re ë 
Vx, sonte-a,  1—2Reē 


~1+4Ree. (2.13) 


Since Re €=Re e, the above asymmetry is sensitive to the same parameter as 
appears in the Kz — sz studies. Here, measurement gives 


Re € = (1.596 + 0.013) x 1073 = |e|cos(44.3 + 0.8)°, (2.14) 


which is consistent with the experimental values from K —> mz. 

Finally, precision experiments also probe the CPT transformation. For example, 
two such predictions, involving kaon masses (m go =m go) and phases (%+4_ = oo 
up to very small corrections from €’), are seen to be consistent with existing data, 
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Fig. IX—4 (a) Penguin and (b) electroweak-penguin contributions to CP violation 
in AS = | transitions. 


|mgo — m gol 


<8 x 107!, 
mM go 
Poo — g4- = (0.2 Æ 0.4)°. (2.15) 


Further study of CPT invariance is left to Prob. IX-2. 
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After diagonalization, there can remain a single phase in the CKM matrix. This 
phase generates the imaginary parts of amplitudes, which are required for CP vio- 
lation. It is a physical requirement that results be invariant under rephasing of the 
quark fields. As a consequence, all observables must be proportional to 


Im A® = A°AîN = C1C2C387 828385, (3.1) 


written in the notation of Sect. II-4. This shows that all CP-violating signals must 
vanish if any of the CKM angles vanish. We shall now study the path whereby this 
phase is transferred from the lagrangian to experimental observables. For kaons, 
we have seen that the relevant amplitudes are those for K°—K° mixing (AS = 2) 
and for K — ma decays (AS = 1). Tree-level amplitudes in kaon decay can never 
be sensitive to the full rephasing invariant, so that one must consider loops. Typical 
diagrams are displayed in Fig. X-4. 

Experiment can help in simplifying the theoretical analysis. Note that €’ is sensi- 
tive to AS = 1 physics through the penguin diagram [GiW 79], while € is sensitive 
to AS = 2 mass-matrix physics as well as to AS = 1 effects. However, since exper- 
iment tells us that |e| >> |e’|, it follows that the AS = 1 contributions to € must be 
small. Likewise, the long-distance contributions of Fig. IX-2 and the contribution 
of Fig. [X—1(d) must both be small because each also involves the AS = 1 interac- 
tion. This leaves the box diagrams of Fig. IX—1(a),(b) as the dominant component 
of e. Moreover, since the CKM phase ô is associated with the heavy-quark cou- 
plings, only the heavy-quark parts of the box diagrams are needed. Hence € is very 
clearly short-distance dominated. 
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Analysis of |e| 


The evaluation of e follows directly from Eq. (2.8). To begin, we shall ignore the 
tiny Im Ag/Re Ay ~ O(10~>) dependence therein.* This leaves us with the issue 
of calculating Im M,>. From the discussion of the ‘box’ hamiltonian 15% given in 
Sect. IX—1, we have 


G$ a - 
Im My = a KMK BRA RA 


x [Mem H (xe) = num; H&A — P) = NeameG (xe, %)]. 8-2) 
Some CKM-related relations and definitions useful in obtaining the above form are 


i i 
Reg =-2(1- 5), Reg =-a(1- 5) at A), 


Imé, = —Imé, = —nA7A°" 


Ae ae 
p=e(i-3). =ni- 
From Eq. (2.8) and Eq. (3.2), we obtain the Standard Model prediction, 
G} Fimg Bg ASH 
3/202 Amkg 
x [nem — Nam? H (x) A40 — p)— Nerm2G (xe, x:)| i (3.3) 


lelsm = 


roughly in accord with the experimental value, given the uncertainties in several of 
the above factors, when lattice determinations of the Bg parameter are used. 


Analysis of |e’ | 


The importance of €’ lies in the fact that it proves that CP violation also occurs 
in the direct AS = 1 weak transition, which is a hallmark of the Standard Model’s 
pattern of CP breaking. For this process, the CP-violating phases from the CKM 
elements can occur only in loop diagrams, and these appear in the penguin diagram 
and in the electroweak penguin process in which the gluon is replaced by a photon 
or a Z? boson, as shown in Fig. IX—4. At first sight, it appears surprising that the 
electroweak penguin plays any significant role, as it is suppressed by a power of a 
compared to the gluonic penguin. However, recall from Eq. (2.8), that «’ measures 
the relative phase difference of the K — mz amplitudes Ag and A2 


Im A> Im Ao Im Ad Im Ao 
Re A2 Re Ao Re A2 Re Ao 


V2 


4 Besides, the combination Im Ao/Re Ao also contributes to €’ (as seen in Eq. (2.8)) and since |e’| < |e|, the 
contribution of this ratio to € is presumably ignorable. 


le'l] = ~ 0.032 (3.4) 
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The gluonic penguin only contributes an imaginary part to Ag because its effect 
is purely AJ = 1/2. The electroweak penguin involves an extra factor of the elec- 
tric charge Q = 513 + I5 which means that the corresponding operator has 
both AZ =1/2, 3/2 components and can contribute an imaginary part to Ap. 
Because the real part of Az is much smaller than that of Ao, by a factor of œ = 
ReA2/ReAo ~ 1/22, the effect of the electroweak penguin is enhanced by the 
small denominator. However, while both diagrams make important contributions, 
it does appear that the gluonic penguin is the larger effect. 

The ingredients to €’ can be expressed numerically [CiIFMRS 95] at the scale 
u =2 GeV in the MS-NDR scheme as 


i Im (V4 Vis 
© 2.1073 A (Gem) a9 |OelK bo cev(l — Op) 
€ 1.3 x 10-3 


— 0.5 GeV (( 70) z2 |Og|K”)2 gey — 0.06] . (3.5) 


Here, we see the primary dependence of the gluonic penquin effect in the matrix 
elements of the penguin operator O6, while the electroweak-penguin (EWP here- 
after) operator is Og. These operators refer back to the decomposition of Eq. (VIII- 
3.31). The factor Ny describes isospin breaking. 

As we mentioned in Sect. VIHI-4, present lattice methods are able to calcu- 
late the Ay amplitude with reasonable precision, while the isospin-zero final-state 
amplitude Ag remains uncalculable. This means that the EWP contribution can be 
obtained, with the result [B1 et al. 12] 


Re (£) = ~(6.25 £0.44 ESE E (3.6) 
€ / EWP 


which has the opposite sign from the experimental result and is about one third 
the magnitude. A chiral analysis that we will describe shortly agrees with this. 
This implies that the phase due to the gluonic penguin Im Aọo/Re Ao must be the 
larger effect and must have the same sign as the experimental determination. This 
seems reasonable in estimates which have been made, as discussed in [CIENPP 12]. 
However, it means that we do not yet have a precise prediction for €’ within the 
Standard Model. 


Chiral analysis of (€'/€)zwp 


The chiral symmetry approach to low-energy hadron dynamics emphasized earlier 
in this book can be used to analyze the electroweak-penguin contribution to e’/e in 
the chiral limit. 
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2 [1 1 
ast (1) ;=2|Os|K”) = -z0 [500 F 5 012410). 


1 
= -g0 Oll, (3.7a) 
where Qs is the four-quark operator 
z a3 = a3 = a3 = a 3 
Qs = qy" TIIVA 5d — IV YSA DIIVA 4, (3.7b) 


and, for notational simplicity, we have suppressed dependence on a second four- 
quark operator (0|Q;|0),, since (0|Qs|0),, >> (012110), [CiDGM 01].> This rela- 
tion can be found either by constructing effective lagrangians or by use of the 
soft-pion theorem of App. B-3. 

Thus, a chiral estimate of the EWP part of «’/e amounts to determining the 
vacuum matrix element of Qg. It turns out that such information is obtainable from 
the large Q? behavior of V — A correlators measured in t decay (cf. Sect. V-3), 


AT1(Q*) = (My,3 — 14,3) (Q”). (3.8) 


The operator-product expansion (OPE) reveals that ATI (Q?) obeys the asymptotic 
behavior 


1 2 
ATI(Q”) ~ 0 [asw + b6(u) In 2] +0(Q°), (3.9a) 


where, from a two-loop study [CiDGM 01], we have 
25 
do(M) = 2 Olas Qsl0). + Flas Q10), +++, 
bs(u) = — (Olas Qsl0)u ++, (3.9b) 


where the ellipses represent higher-order terms in the OPE. Thus, the needed infor- 
mation (i.e. (0|Q|0),,) is contained in the large energy component of ATI (Q7), but 
how can we access it? This problem has been solved in two different papers, which 
use two alternative approaches. In the first of these [CiDGM 01], one employs sum 
rules like 


w 


01910), = ds s? 
(0| Qs|0) ,, Í a 


PE Ap(s)+-, (3.10) 
where the ellipses denote contributions from d > 6 terms in the OPE. This approach 
yields a determination [e / TA = (—22 + 7) x 1074, having a 32% uncertainty. 
The superscript indicates working in the chiral limit of massless u, d, s quarks. A 
second method [CiGM 03], which analyzes tau decay spectral functions by using a 


finite-energy sum rule (FESR), leads to Eases = (—15.0 + 2.7) x 1074, having 


5 The effect of (012110), is, of course, included in the full analysis of [CiDGM 01]. 
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an 18% percent uncertainty. Upon including chiral corrections, the physical result 
[e Te lees = (—11.0 + 3.6) x 1074 is obtained. Together with the lattice evalua- 
tion quoted in Eq. (3.6), these evaluations firmly imply that [e /e ewe < 0 and 
that the QCD penguin effect must be large and positive in order to reproduce the 


experimental value for €'/€ of Eq. (2.11). 


IX-4 The strong CP problem 


The possibility of a 0 term in the QCD lagrangian raises potential problems (see 
Sect. III-5). For 0 4 0, QCD will in general violate parity and, even worse, time- 
reversal invariance. The strength of T violation (and hence, by the CPT theorem, 
CP violation) is known to be small, even by the standards of the weak interaction. 
This knowledge comes from both the observed K; — 2x decay and bounds on 
electric dipole moments. From these it becomes clear that QCD must be T invari- 
ant to a very high degree. However, there is nothing within the Standard Model 
which would force the 0 parameter to be small; indeed, it is a free parameter lying 
in the range 0 < 0 < 2x. The puzzle of why 0 ~ 0 in Nature is called the strong 
CP problem. 

One is tempted to resolve the issue with an easy remedy first. If QCD were the 
only ingredient in our theory, we could remove the strong CP problem by impos- 
ing an additional discrete symmetry on the QCD lagrangian, the discrete symmetry 
being CP itself. This wouldn’t really explain anything but would at least reduce a 
continuous problem to a discrete choice. In reality, this will not work for the full 
Standard Model since, as we have seen, the electroweak sector inherently violates 
CP. It would thus be improper to impose CP invariance upon the full lagrangian. 
Moreover, even if one could set Oare = 0 in OCD, electroweak radiative corrections 
would generate a nonzero value. These turn out to occur only at high orders of per- 
turbation theory, and are expected to be divergent by power-counting arguments, 
although they have not been explicitly calculated. This divergence is not a funda- 
mental problem because one could simply absorb Obare plus the divergence into a 
definition of a renormalized parameter Oen, which could be inferred from experi- 
ment. However, we are then back to an arbitrary value of O;e, and to the problem of 
why Oren is small. 


The parameter 0 


The situation is actually worse than this in the full Standard Model, as the quark 
mass matrix can itself shift the value of 6 by an unknown amount. Recall that 
CP violation in the Standard Model arises from the Yukawa couplings between 
the Higgs doublet and the fermions. When the Higgs field picks up a vacuum 
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expectation value, these couplings produce mass matrices for the quarks, which 
are neither diagonal nor CP-invariant. The mass matrices are diagonalized by sep- 
arate left-handed and right-handed transformations, and CP violation is shifted to 
the weak mixing matrix. However, because different left-handed and right-handed 
rotations are generally required, one encounters an axial U(1) rotation in this 
transformation to the quark mass eigenstates and, as discussed in Sect. III—5, this 
produces a shift in the value of 6. Let us determine the magnitude of this shift. 
Denoting by primes the original quark basis, one has the transformation to mass 
eigenstates given by (cf. Eqs. II—4.5,4.6) 


m = Sim’Sp, vi = Sit, Wr = Siw. (4.1) 


Here, we have combined the u and d mass matrices into a single mass matrix. 
Expressing Sz,r as products of U (1) and SU(N) factors, 


Sr = eL S, SR = eR SpR, (4.2) 


with Sz, Sr in SU(N), one obtains an axial U (1) transformation angle of gr — 
L. From the discussion of Sect. III-5, this is seen to lead to a change in the 0 
parameter, 


0 > 0 =0 +2N; (PL — gr), (4.3) 


where N  =6 for the three-generation Standard Model. However, noting that the 
final mass matrix m is purely real, we have 


arg(detm) = 0 = arg (det Si detm’ det Sr) 


= arg (det si) + arg (det m’) + arg (det Sp) 
= 2N (Qr — YL) + arg (det m’) , (4.4) 


where we have used the SU(N) property, det Sp = det S; = 1. The resultant 6 
parameter is then 


6 = 0 + arg (detm’) , (4.5) 


with m’ being the original nondiagonal mass matrix. The real strong CP problem 
is to understand why 6 is small. 

One possible solution to the strong CP problem occurs if one of the quark masses 
vanishes. In this case, the ability to shift 0 by an axial transformation would allow 
one to remove the effect of 0 by performing an axial phase transformation on the 
massless quark. Equivalently stated, any effect of © must vanish if any quark mass 
vanishes. Unfortunately, phenomenology does not favor this solution. The u quark 
is the lightest, but a value m, Æ O is favored. 
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Connections with the neutron electric dipole moment 


The 6 term is not the source of the observed CP violation in K decays. This can 
be seen because it occurs in a AS =O operator, and while this may ultimately 
generate effects in AS = 1 processes, its influence is stronger in the AS = 0 sector. 
In particular, it generates an electric dipole moment d, for the neutron. Since no 
such dipole moment has been detected, one can obtain a bound on the magnitude 
of 0. 

To determine the effect of 0, it is most convenient to use a chiral rotation to shift 
the 6 dependence back into the quark mass matrix. A small axial transformation 
produces the modified mass matrix 


Lms =Y ma y +inpTysy = YL Myr + YrM' Yi, (4.6) 


Ms 


where 7 is a small parameter proportional to 0 having units of mass, and T is 
a 3 x 3 hermitian matrix. Consistency requires T to be proportional to the unit 
matrix. If this were not the case, and instead we wrote T = 1+ A,7;/2, the effective 
lagrangian would start out with a term linear in the meson fields, 


Leg ~ in Tr (TUt — UT") =2 (Damo + Tans +++), (4.7) 


rather than the usual quadratic dependence. The vacuum would then be unstable 
because it could lower its energy by producing nonzero values of, say, the zo 
field. Thus, to incorporate 6-dependence without disturbing vacuum stability, one 
chooses T = 1. The act of rotating away any dependence on @ produces a nonzero 
value of arg(det M ), and also determines n, 


6 = arg(det M) = arg [(m, + in) (ma + in) (ms +in)], 
M,MqMs 


xo f lln), 4.8 
4 M,Mqg + MMs +mgm, (ror sinall) (4.8) 
such that the mass terms become 
Lass = m,uu + madd + mMm,sSsS 
= MqMqMs _ = _ 
+16 (ūysu + dysd + Sys) i (4.9) 


my,Mq ++ myMs F MqMs 


The last term is the CP-violating operator of the QCD sector. Note that, as 
expected, 0 vanishes if any quark is massless. 
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A nonzero neutron electric dipole moment d, requires both the action of the 
above CP-odd operator and that of the electromagnetic current, 


1 
=al v — , CP-odd em 
deit(p' ong" ysu (P) = 2 ap) e T |ye"|n(p)), (4.10) 
where q = p’ — p and we have inserted a complete set of intermediate states {7} in 
the neutron-to-neutron matrix element. For intermediate baryon states, the matrix 
elements of Y ysy are dimensionless numbers of order unity and magnetic moment 
effects are of order the nucleon magneton, un. Thus, we find for de, 


M,MqmMs CbLn 


de ~ 8 (4.11) 


3 
My,Mq T MyMs T MqaMs AM 


where AM is some typical energy denominator. Using AM = 300 MeV, we obtain 
de ~ 8 x 107" e-cm. (4.12) 


Far more sophisticated methods have been used to calculate this, with results that 
have a spread of values [EnRV 13]. Our simple estimate is near the average. In 
explicit calculations, some subtlety is required because one must be sure that the 
evaluation correctly represents the U(1),4 behavior of the theory. However, the 
precise value is not too important; the significant fact is that bounds on de <3 x 
107% e-cm require 6 to be tiny, 9<107!!. 

The strong CP problem does not have a good resolution within the Standard 
Model. It would appear that the abnormally small value of 6, and of the cosmo- 
logical constant as well, are indications that more physics is required beyond that 
contained in the Standard Model. 


Problems 


(1) Strangeness gauge invariance 
(a) Physics must be invariant under a global strangeness transformation |M) —> 
exp(iAS)|M), where A is arbitrary. Explain why this is the case. 
(b) Demonstrate that such a transformation has the effect 


Im A; Im A; Im M12 Im Mj 
=> Xr > — 


+ $ — 2), 
Re A; Re A; Re Mj Re Mj 


as claimed in Eq. (2.10), and that, while unphysical quantities such as €, & 
are affected by such a change, physical parameters such as €, €’ are not. 
(2) Neutral kaon mass matrices and CPT invariance 
Some of the ideas discussed in this chapter can be addressed in terms of simple 
models of the neutral kaon mass matrix M which appears in Eq. (1.2). 


(a) 


(b) 


(c) 
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Consider the following CP-conserving parameterization as defined in the 


(K°, K?) basis: 
mo A 
Mo, = 
i ( i E 


where A is real-valued. Determine the basis states (K_, K+) in which 
Mo — M+ becomes diagonal and obtain numerical values for mọ, A. 
Working in the (K_, K) basis, extend the model of (a) to allow for CP 
violation by introducing a real-valued parameter ô, 


m- 0 ,_(ù(m- —iô 
m=(" w) = ma! = ("5 0). 


and assume there is no direct CP violation. This mass matrix corresponds 
to the superweak (SW) model. By expressing M4’ in the (K°, K?) basis, 
use the analysis of Sects. IX-1,2 to predict pS”) = phase e and determine 
ô from the measured value of |e]. 

Finally, extend the model in (b) to 


M. n = m_ x 
= x* m, , 


where Re x is a T-conserving, CP-violating, and CPT-violating parameter. 


Show that the states which diagonalize M4” are 


X 
|Ks) ~ E — lh 


x* 
(Kik ~ |K-) + F144), 


where D = (mr —ms)/2+iT s/4. Then evaluate n+- and noo, allowing for 
the presence of direct CP violation (i.e. €" 4 0), and derive the following 
relation between phases, 


(SW) 
pas 


1 |m —mxo| . 
= SE sing 


2 1 
xi (5e + Fow- A 2m xo m; —Ms 


The result |mzo—m xo|/mxo < 5x107 18 which follows from this relation, 
is the best existing limit on CPT invariance. 


X 


The N7! expansion 


The N7! expansion is an attempt to create a perturbative framework for QCD 
where none exists otherwise. One extrapolates from the physical value for the 
number of colors, Ne = 3, to the limit Ne — oo while scaling the QCD cou- 
pling constant so that g3N, is kept fixed [’tH 74]. The amplitudes in the theory are 
then analyzed in powers of N~!. The hope is that the Ne —> oo world bears suffi- 
cient resemblance to the real world to yield significant dynamical insights. There 
is no magical process which makes the Ne — oo theory analytically trivial; non- 
linearities of the nonabelian gauge interactions are present, and the theory is still 
not solvable. However any consistent approximation scheme for QCD is welcome, 
and the large N. expansion is especially useful for organizing one’s thoughts in the 
analysis of hadronic processes. 


X-1 The nature of the large N, limit 


In passing from SU (3) to SU(N), the quark and gluon representations, originally 
3 and 8, become Ne and N? — 1 respectively. The analysis of Feynman graphs at 
large N. is simplified by modifying the notation used to describe gluons. As usual, 
quarks carry a color label j, with j = 1,2,..., Ne. Gluons can be described by 
two such labels, i.e. 


a k J 
Ay > Ay (Aj; = 9), (1.1) 
where a = l,..., N? — l and j,k = 1,..., N.. In doing so, no approximation 


is being made. The new notation is simply an embodiment of the group product 
Ne x Ne > N? — 1) @1. The quark-gluon coupling is then written 


sw y AS, (1.2) 
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(a) (b) (c) 


(d) (e) 


Fig. X—1 Double-line notation: (a) quark and (b) gluon propagators, (c) quark- 
gluon, (d) three-gluon, and (e) four-gluon vertices. 


and the gluon propagator is 
fe e'f*(0|T (Ai (x)A,,(0))| 0) = (6/84 — No'i) i Duala) 3) 


The term proportional to N7! must be present to ensure that the color singlet com- 
bination vanishes, A : ; = 0. However, as long as we avoid the color-singlet chan- 
nel, this term will be suppressed in the large N, limit and may be dropped when 
working to leading order. 

Using this new notation, the Feynman diagrams for propagators and vertices are 
displayed in Fig. X—1. A solid line is drawn for each color index, and each gluon 
is treated as if it were a quark—antiquark pair (as far as color is concerned). In this 
double-line notation, certain rules which are obeyed by amplitudes to leading order 
in 1/N. emerge in an obvious manner. Although general topological arguments 
exist, we shall review these rules by examining the behavior of specific graphs. The 
power of Feynman diagrams to build intuition is rather compelling in this case. 

We consider first the familiar quark and gluon propagators. The quark propaga- 
tor, unadorned by higher-order corrections, is O(1) in the Ne — oo limit. Fig. X-2 
depicts two radiative corrections. Fig. X—2(a), the one-gluon loop, is O(1) in pow- 
ers of N, because the suppression from the squared coupling g3 is compensated 
for by the single closed loop, which corresponds to a sum over a free color index 
and thus contributes a factor of N.. The graph then is of order 83 Ne, which is taken 
to be constant. The graph Fig. X-2(b) with overlapping gluon loops is O(N”) 
because, with no free color loops, it is of order gj = (82N) N~? ~ N-?. The 
terms planar and nonplanar are used, respectively, to describe Figs. X—2(a),(b), 
because the latter cannot be drawn in the plane without at least some internal lines 
crossing each other. 

Four distinct contributions to the gluon propagator are exhibited in Fig. X—3. 
Figs. X—3(a),(b) depict in double line notation the quark—antiquark and two-gluon 


iON! ana. 


(a) (b) 


Fig. X—2 Radiative corrections to the quark propagator: (a) planar, (b) nonplanar. 


280 The N~! expansion 
-O -O- -@- 
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Fig. X-3 Various radiative corrections to the gluon propagator. 


loop contributions. It should be obvious from the above discussion that these are 
respectively O(N!) and O(1). A new diagram, involving the three-gluon cou- 
pling, appears in Fig. X—3(c). With three color loops and six vertices, it is of order 
(NY = O(1). Figure X—3(d) is a nonplanar process with six vertices and one 
color sum, and is thus O(N’). 

The discussion of the gluon propagator indicates why we constrain g3N- to be 
fixed when taking the large N. limit. The beta function of QCD is determined to 
leading order by Figs. X-3(a),(b). If gj were held fixed, the beta function would 
become infinite in the large N, limit, leading to the immediate onset of asymptotic 
freedom. The choice ZN. ~ constant leads to a running coupling constant and is 
compatible with the behavior for the realistic case of N, = 3. 

To summarize, there are several rules which can be abstracted from examples 
such as these: (i) the leading-order contributions are planar diagrams containing 
the minimum number of quark loops; (ii) each internal quark loop is suppressed 
by a factor of N~'; and (iii) nonplanar diagrams are suppressed by factors of N-?. 
The suppressions in rules (ii), (iii) are combinatorial in origin. Quark loops and 
nonplanarities each limit the number of color-bearing intermediate states, and con- 
sequently cost factors of N7!. 


X-2 Spectroscopy in the large N, limit 


In order for the large N, limit to be relevant to the real world, it must be assumed 
that confinement of color-singlet states continues to hold. In this case, we expect 
the particle spectrum to continue to be divided into mesons and baryons. Let us 
treat the mesons first. 

One can form color-singlet meson contributions from Q Q pairs. To form a color 


singlet, one must sum over the quark colors. In order to produce a properly nor- 
1/2 « 


malized QO state one must therefore include a normalization factor of N. /~ into 
each QO meson wavefunction, such that 
z 1 , 
a a B)+ 
QO cor ~ —— bO ag?" 0}, (2.1) 
| singlet JN 
where g, 6 are flavor labels, i = 1,..., Ne is the color label, and b (dÝ) are 


the quark (antiquark) creation operators. Meson propagators, as represented in 
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Fig. X-4 Mesons in the double line notation. 


Fig. X—4(a), are then O(1) in Ne since the factors of (Ne = 22 from the normaliza- 
tion of the wavefunction are compensated by a factor of N, from the quark loop. 
This leads to the prediction that meson masses are of O(1) in the large Ne limit, 
i.e., they remain close to their physical values. Multiquark intermediate states, as in 
Fig. X-4(b), are suppressed by 1/N,, indicating a suppression of mixing between 
QO and Q? Q? sectors. That is, large No plus confinement implies the existence of 
QO mesons which contain an arbitrary amount of glue in their wavefunction, but 
which do not mix with Q? Q? states. 

The quark content of a given hadron remains an issue of some theoretical and 
phenomenological interest. Several examples are given in Sect. XIII—4 of hadron 
states which are thought to be ‘nonconventional’. One such is the o hadron, which 
is the lightest resonance found in zz scattering. Analysis has yielded insight as to 
the N, dependence of the o mass (cf. Eq. (XIII-4.7)). We reserve further comment 
on this interesting topic to Chap. XIII. 

What about the decay widths of Q Q mesons? The decay amplitude is pictured 
in Fig. X—5 (other possibilities involve the suppressed quark loops). This diagram 
contains three meson wavefunctions and one quark loop and hence is of order 
(N PPN. — |, in amplitude or N7! in rate. The large Ne limit thus involves 
narrow resonances, i.e., T/M — 0, where I is the meson decay width and M is 
the meson mass. This is reasonably similar to the real world, where most of the 
observed resonances have T/M ~ 0.1-0.2 [RPP 12]. 

Color-singlet gluonic states, called glueballs, may also exist. The normalization 
of a glueball state can be fixed by means of the following argument. Suppose, as 
will be defined in a gauge-invariant manner in Sect. XII—4, that a neutral meson 
can be created from two gluons. Then in normalizing this configuration, one must 
sum over the N? gluon color labels. As a consequence, a normalization factor N7! 
is associated with each glueball state. Glueball propagators also emerge as being 
O(1). There is no physical distinction between two-gluon states, three-gluon states, 


M2 


M, 
M3 


Fig. X-5 Strong interaction decay of a QQ meson. 
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Fig. X-6 Meson—meson scattering. 


etc., because all are mixed with each other by the strong interaction. As a result, 
there need not be any simple association between a specific physical state and 
gluon number, and thus the concept of a ‘constituent gluon’ need not be inferred. 
In glueball decays, however, one must distinguish between glueballs decaying to 
other glueballs, and those decaying to Q Q mesons. Where kinematically allowed, 
the decay of glueballs to glueballs is O(1), while that to QO states is O (1/N-). 
The lowest-lying glueball(s) will then be narrow, while those above the threshold 
for decay into two glueballs will be of standard, nonsuppressed width. 

Meson-meson scattering amplitudes are also restricted by large N, counting 
rules. Consider the diagrams of Fig. X—6. That of Fig. X—6(a) is of order (Ne 1/ a 
N: ~ Nz', whereas Fig. X-6(b) is O(N”) because of the extra quark loop. The 
scattering amplitudes thus vanish in the large N, limit, and the leading contribu- 
tions are connected, planar diagrams. 

The large N. limit also predicts that neutral mesons (i.e., Q® QP composites 
with œ = £) do not mix with each other. The possible mixing diagram is given 
in Fig. X—7, and includes any number of gluons. However, because of the extra 
quark loop, it is of order N7!, and thus vanishes in the infinite color limit. This 
means that wi states do not mix with dd or Ss, nor do the latter two mix. The large 
N, spectrum thus displays a nonet structure with the ui and dd states degenerate 
(to the extent that electromagnetism and the mass difference between the u and d 
quarks are neglected) and the ss states somewhat heavier. This pattern is reflected 
in Nature, except that the mu and dd configurations now appear as states of definite 
isospin, wu + dd. For example, let us consider the J PC = 177, 2++ mesons. 
For the former, 0(770) and w(783) are interpreted as uu, dd isospin J = | and 
I = 0 combinations, while g(1020) is the ss member of the nonet. Including the 
K*(892) doublet as the zs, ds combinations, a simple additivity in the quark mass 
would imply 


Fig. X-7 Meson—meson mixing. 


X-3 Goldstone bosons and the axial anomaly 283 
™ (1020) — M p(770) = 2 (mx*(g92) mpm) ; (2.2) 


which works well. A similar treatment of the 2*t mesons, identifying a2(1320) 
and f2(1270) as the corresponding uu, dd states and (1525) as an ss composite, 
predicts 


mM 31525) — May (1320) = 2 (™m x+(1430) = Ma(1320)) , (2.3) 


which is also approximately satisfied. The fact that o (770), w(783), f2(1270) and 
a2(1320) decay primarily to pions, and g(1020) and f§(1525) decay primarily to 
kaons, reinforces this interpretation. 

The world of baryons in the large N, limit is quite different from that of mesons 
and glueballs [DaJM 94, Je 98]. In order to form a color singlet, one needs to 
combine not three quarks but rather N. quarks in a totally color-antisymmetric 
fashion. This forces the baryon mass to grow as N., i.e., to become infinitely heavy 
in the N. — oo limit. In an attempt to model this behavior, it has been suggested 
that baryons can be associated with the soliton solution, called the Skyrmion, of a 
certain chiral lagrangian [Sk 61, Wi 83b]. We shall discuss this idea in Sect. XI-4 
in the context of a model with SU(2); x SU(2)r symmetry. 


X-3 Goldstone bosons and the axial anomaly 


As stated in the previous section, it must be assumed that color confinement con- 
tinues to hold in the large N. limit. Given this behavior, it can be proven under 
reasonable conditions that chiral symmetry is spontaneously broken [CoW 80]. In 
this circumstance, the large N, limit turns out to imply a fascinating unity between 
the 7'(960) meson and the octet of Goldstone bosons in massless QCD [Wi 79]. 
The N: = œ analog of ņ' (960) is also a Goldstone boson if quarks are massless. 

In order to see this, let us first consider the pseudoscalar decay constants. Because 
the axial-current matrix elements 


(Pj(q) |Ag (0)| 0) = —i Fjqôjr (3.1) 


involve one meson normalization factor and one quark loop, they are of order 
(N.'/?)N. ~ Nn, which then implies F; = O(N.’”). Now consider the cur- 
rent divergence in the limit of massless quarks. In general, we have 


For the octet of currents, the divergence vanishes for zero quark mass, and as usual 
leads to the identification of 7, K, ng as Goldstone bosons. However, for the singlet 
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current the anomaly is present. Even in the limit of vanishing quark mass, the 
current divergence has nonzero matrix elements, in particular, 


383 ea 
3 pa paw 
3272 WY 


WE [3.8 00} = Fama = (ne) 


o) ; (3.3) 


If one repeats the calculation of the anomalous triangle diagram as in Sect. II-3 but 
now allows JN to be arbitrary, one sees that it is proportional to Tr (ea) Se 04 
and is therefore independent of N.. However, by using large N.-counting rules, the 
matrix element in Eq. (3.3) is seen to be of order eNe” ~ NT"?! This implies 
that the gluonic contribution to the axial anomaly vanishes in the large N, limit. 
When we take into account the behavior of F,,, we conclude that m?, ~1/N.—> 0. 
The 7’ is thus massless in the large N, limit, and we end up with a nonet of 
Goldstone bosons. 

To illustrate what happens when the number of colors is treated perturbatively, 
let us consider the 1/N, corrections to the meson spectrum together with the effects 
of quark masses. If we first add quark masses, we have, in analogy with the results 
of Sect. VII—1, the mass matrix 


m;, = (P; |m(uiu + dd) + m5s| Pj), (3.4) 
where we have taken m, = my = m. This leads to a squared-mass matrix 


2m 0 0 0 


' eae (3.5) 
0 0 5m tm) 3-(m— ms) 
0 0 Gi-—ms) 2m, + 2m) 


in the basis (x, K, ng, no). If this were diagonalized, one would find an isoscalar 
state degenerate with the pion. This is a manifestation of the U(1) problem, which 
arises when there is no anomaly. However, at the next order in large N<, the matrix 
picks up an extra contribution in the SU (3)-singlet channel due to the anomaly, 
yielding 


2m 0 0 0 
0O m+ 0 0 


; (3.6) 
0 0 22m, + ñ) 2V2 — ms) 


0 0 MGh—-m) 4+2) + Z| 


! This result depends on the assumption that topologically nontrivial aspects of vacuum structure are smooth in 
the Ne — oo limit. 
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where € = O(N°). This mass matrix yields an interesting prediction. The quanti- 
ties Bom and Bom, are fixed as usual by using the x and K masses. Also the trace 
of the full matrix must yield mz +m +m; +m_,, which fixes € = 2.16 GeV’. The 
remaining diagonalization then predicts my = 0.98 GeV, m, = 0.50 GeV with a 
mixing angle of 18°. This is a remarkably accurate representation of the situation 
in the real world. Although €/ WN, is suppressed in a technical sense, note how large 
it actually is. One is hard pressed to imagine any sense in which the physical n’ 


mass can be taken as a small parameter. 


X—4 The OZI rule 


In the 1960s, an empirical property, called the Okubo—Zweig—lizuka (OZI) rule 
[Ok 63, Zw 65, Ii 66], was developed for mesonic coupling constants. Its usual 
statement is that flavor-disconnected processes are suppressed compared to those 
in which quark lines are connected. In the language which we are using here, fla- 
vor disconnected processes are those with an extra quark loop. Unfortunately, the 
phenomenological and theoretical status of this so-called rule is ambiguous. We 
briefly describe it here because it is part of the common lore of particle physics. 

The empirical motivation for the OZI rule is best formulated in the decays of 
mesons. Let us accept that (1020) and f}(1525) are primarily states with content 
ss whereas w(783) and f2(1270) have content (au + dd) / V2. Mixing between 
the ss and nonstrange components can take place with a small mixing angle, such 
that 


Amp (ss) 
Amp (tāu + da\/V2) 


= tan, (4.1) 


with 6 = 0y for the vector mesons and 6 = Or for the tensor mesons. In both 
cases, 0 is small. Experimentally, the g and f, decay dominantly into strange par- 
ticles even though phase space (abbreviated as ‘p.s? below) considerations would 
strongly favor nonstrange modes, 


Vossrto 9.18 Tpotnton ~ 9 99 
eg T os3r í 
Dp sax Pp sax (4.2) 
— = 0.012 + 0.002 — = 0.004 + 0.001 ` 
Typ Kk fooun 
~ 0.003 x p.s., ~ 0.002 x p.s. 


This suggests the hypothesis ‘ss states do not decay into final states not containing 
strange quarks’. Diagrammatically this leads to a pictorial representation of the 
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M M 


(a) (b) 
Fig. X-8 OZI (a) allowed, (b) forbidden amplitudes. 


OZI rule, viz., the dominance of Fig. X—8(a) over Fig. X—8(b). Some scattering 
processes also show such a suppression. For example, we have 


EBON 2) (16) (4.3) 
On- pon 
which can be interpreted as an OZI suppression. A stronger version of the OZI 
tule would have the g/w and f3/fz ratios equal to a universal factor of tan? 6 
(cf. Eq. (4.1)) once kinematic phase space factors are extracted. 

The narrow widths of the J/y and Y states are also cited as evidence for the 
OZI rule, since these hadronic decays involve the annihilation of the cé or bb con- 
stituents. This can be correct almost as a matter of definition, but it is not very 
enlightening. Indeed, the small widths of heavy-quark states can be understood 
within the framework of perturbative QCD without invoking any extra dynamical 
assumptions. However, perturbative QCD certainly cannot explain the OZI rule in 
light mesons. It must have a different explanation for these states. 

There actually exist several empirical indications counter to the OZI rule [Li 84, 
EIGK 89, RPP 12]. Among the more dramatic examples of OZ/-forbidden reac- 
tions, expressed as ratios, are 


r sonta- => pontn 
2 Jip ont 12405: Ovp> penta =F 0407. 


Vy /w—+oK+K- Oy p> poK* K- (4 4) 
(oo "1 O. EM l 
OOA a P > 5 (90% CL). 

On- p— fy n Oy p> ppK*K- 


The universal-mixing model is incorrect more often than not, with counterexamples 
being 


T >gortr PERT” 
6 4 0s, i 0.02, 
Py jysontn- Oyp> pont n- (4.5) 


= 0.029" 675 


3 


Opp- fjntn- 


Opp>fnrtr 
instead of the values 0.03, 0.03, and 0.006 expected from the previous ratios. The 
empirical 7—n’ mixing angle 6, ~ —20° also violates the OZI rule, which would 
require a mixing angle of —35°. 
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There is also an intrinsic logical flaw with the simplest formulation of the OZI 
rule. This is because OZ/J-forbidden processes can take place as the product of 
two OZ]-allowed processes. For example, each of the following transitions is OZI- 
allowed: 


fp > KK, KK xn, 


/ (4.6) 
fh >m, nn > ar. 
Hence the OZ/-forbidden reaction f; —> mz can take place by the chains 
fp > KK > xx, fı > m> rr. (4.7) 


These two-step processes are in fact required by unitarity to the extent that the 
individual scattering amplitudes are nonzero. 

The large N. limit provides the only known dynamical explanation of the OZI 
rule at low energies. Although the gluonic coupling constant is not small at these 
scales and suppressed diagrams have ample energy to proceed, they are predicted 
to be of order 1 /N? in rate because of the extra quark loop. Yet large N, arguments 
need not suggest a universal suppression factor of tan? 6, because there is no need 
for the 1/N, corrections to be universal. Note that the large N. framework also 
forbids the mixing of 7 and 7’ and, more generally, the scattering of mesons. 

Thus, the OZ/ rule in light-meson systems remains somewhat heuristic. It has 
a partial justification in large N, counting rules, but it also has known violations. 
It is not possible to predict with certainty whether it will work in any given new 
application. 


X-5 Chiral lagrangians 


The large N. limit places restrictions on the structure of chiral lagrangians 
[GaL 85a]. To describe these, we must first allow for an enlarged number Ny > 3 
of quark flavors. The three-flavor O(E*) lagrangian is expanded as 


10 
Lamy TO, (5.1) 
i=l 
where the {0;} can be read off from Eq. (VII-2.7). Recall that in constructing £4, 
we removed the O(E*) operator 
Oo = Tr (D,UD,U' D“UD'U"), (5.2) 


because for Np = 3 it is expressible (cf. Eq. (VII-2.3)) as a linear combination of 
O},2,3. However, if the number of flavors exceeds three, one must append Op to the 
lagrangian of Eq. (5.1), 
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10 3 10 
L=} L0: — J B0 +) LiO:. (5.3) 
i=l i" 7=0 i=4 


In view of the linear dependence of Op on O1,2,3, note that we have needed to 
modify the coefficients L1,2,3 —> B1,2,3. Upon returning to three flavors, we regain 
the original coefficients, 


B 
Li= -tB Lz = Bo+ Bo, L3 = —2Bo + Bs. (5.4) 


We can now study the large N, behavior of the extended O(E*) chiral lagrangian. 
The distinguishing feature is the number of traces in a given O(E*) operator. Each 
such trace is taken over flavor indices and amounts to a sum over the quark flavors, 
which in turn can arise only in a quark loop. In particular, those operators with two 
flavor traces (O1,2,4,6,7) will require at least two quark loops, while those with one 
flavor trace need only one quark loop. However, our study of the large N. limit 
has taught us that every quark loop leads to a 1/N, suppression. Thus, the O(E*) 
chiral contributions having two traces will be suppressed relative to those with one 
trace by a power of 1/N,, and provided B3 Æ 0 we can write? 

Bı B2 La Le 


B3 B3 L3 B3 Ne>0o 


Alternatively, this N.-counting rule implies (provided Bo/ B3 # 1/2) for the {B;} 
coefficients of flavor SU (3), 
2Lı—-L) L4 L 


6 = 
= = — z . b 
Iz L; L; O(N. ) (5.5b) 


The overall power of N, for the remaining terms can be found by noting that the 
m7 scattering amplitude should be of order N-', implying L123 = O(N-). 

The only exception to the above counting behavior is the operator with coef- 
ficient L7. This exception occurs because the operator can be generated by an n’ 
pole, and the 7’ mass-squared is O (1/N-.). In particular, the coefficient of this term 
is absolutely predicted in the large N, limit. This follows if we include the large Ne 
result for mixing between 77 and 7’ shown in Eq. (3.6) as a chiral lagrangian 


F. 
La = — n Tr (xU — U*x), 5.6 
nn age" (x x) (5.6) 


which when expanded to order nong will yield the off-diagonal term in the mass 
mixing matrix of Eq. (3.6). Integrating out the no ~ n’ leads to the effective 
lagrangian 


2 The operator O7 presents a special case and is discussed below. 
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AR m2 
48 my 


Lest = [Tr (x*U —U*x)]. (5.7) 


It is the factor of m, which overcomes the counting rules. Although the dou- 
ble trace suggests that this operator is suppressed in the large N, limit, we have 
ey & Ne. Thus, at least formally, an extra enhancement would be predicted. 

The large Ne limit then predicts the following ordering of the chiral coefficients 
in L4: 


m 


Lı = O(N?), 
Ly, L2, L3, Ls, Lg, Lo, Lio = OW), 
2Lı — L2, L4, L= 0 (1). (5.8) 


An existing empirical test involves the occurrence of 2L; — L in K > mmev, 
decays [Bi 90, RiGDH 91], and the prediction works quite well. The large Ne 
enhancement of L7 is probably just a curiosity in that the physical value of the 7’ 
mass is not small compared to other masses in the theory, and hence the technical 
advantage of m, œ N7! is probably not useful phenomenologically. 


Problems 


(1) The large N, weak hamiltonian 
Retrace the calculation of the QCD renormalization of the weak nonleptonic 
hamiltonian described in Sect. VIII—-3, but now in the limit Ne —> oo with 
g3N- fixed. Show that the penguin operators do not enter and that all short- 
distance effects are of order N7 ', with the operator-product coefficients c; = 1, 
C2 = 1/5, C3 = 2/15, C4 = 2/3, C5 = C6 = 0. 

(2) The strong CP problem in the large N, limit 
In the large N. limit, the no can be united with the Goldstone octet in the 
effective lagrangian. Generalizing the chiral matrix to nine fields we write 
L = Lo + Lyi where 


F? pA FP — 
Lo = Tr (a,.00"0") + — Bo Tt (mG + 0%), 
F? z -+ 72 
L= Ttt], 


v= DN F ee 
= exp (ià -ọ/F) exp{i ron i 
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(a) Confirm that this reproduces the mixing matrix of Eq. (3.6). 
(b) Another way to obtain this result is to employ an auxiliary pseudoscalar 
field g(x) (with no kinetic energy term) to rewrite £ nr! as 
Ey a(x) $ seal (InU —In U*), 
Identify the SU(3)-singlet axial current and calculate its divergence to 
show that q(x) plays the same role as FF, i.e., q(x) ~ aF F/8x. 
Integrate out g(x) to show that this is equivalent to the form of part (a). 


(c) Several authors [RoST 80, DiV 80] suggest adding the 0 term through 


wa 


L = Lo + Lya — 6q(x). 


From this starting point, integrate out q(x) and show that a chiral rotation 
can transfer 0 to arg(det m). However, in the sense described in 
Sect. IX—4, this theory is unstable about U = 1. The stable vacuum corre- 
sponds to in = ô; exp(iB;). For small 0, solve for 6; in terms of 6. 

(d) Using U = e'6/2Ue'b/?, define the fields about the correct vacuum to find 
the CP-violating terms of the form 


Lcr = i0 [a Tr (U — U') + bTr (nU — InU')], 


identifying a and b and showing they vanish if any quark mass vanishes. 
Calculate the CP-violating amplitude for n > atx. 


XI 


Phenomenological models 


QCD has turned out to be a theory of such subtlety and difficulty that a concerted 
effort over an extended period has not yielded a practical procedure for obtaining 
analytic solutions. At the same time, vast amounts of hadronic data which require 
theoretical analysis and interpretation have been collected. This has spurred the 
development of accessible phenomenological methods. We devote this chapter to 
a discussion of three dynamical models (potential, bag, and Skyrme) along with a 
methodology based on sum rules. Although the dynamical models are constructed 
to mimic aspects of QCD, none of them is QCD. That is, none contains a rigor- 
ous program of successive approximations which, for arbitrary quark mass, can be 
carried out to arbitrary accuracy. Therefore, our treatment will emphasize issues 
of basic structure rather than details of numerical fits. By using all of these meth- 
ods, one hopes to gain physical insight into the nature of hadron dynamics. Despite 
its inherent limitations the program of model building, fortified by the use of sum 
rules, has been generally successful, and there is now a reasonable understanding 
of many aspects of hadron spectroscopy. 


XI-1 Quantum numbers of QQ and Q? states 


Among the states conjectured to lie in the spectrum of the QCD hamiltonian are 
mesons, baryons, glueballs, hybrids, dibaryons, etc. However, since practically all 
currently known hadrons can be classified as either QO states (mesons) or Q? 
states (baryons), it makes sense to focus on just these systems. We shall begin by 
determining the quark model construction of the light-hadron ground states. Much 
of the material will be valid for heavy-quark systems as well. 


Hadronic flavor—-spin state vectors 


In many respects, the language of quantum field theory provides a simple and 
flexible format for implementing the quark model. Let us assume that for any given 
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dynamical model, it is possible to solve the field equations of motion and obtain 
a complete set of spatial wavefunctions, {Wa (x)} for quarks and {wg(x)} for anti- 
quarks, where the labels œ and @ refer to a complete set of observables. A quark 
field operator can then be expanded in terms of these wavefunctions, 


Wx) = > [Wanye ba) + Waxed" E], (1.1) 


a 


where œx, Wa are the energy eigenvalues, b(œ) destroys a quark and d‘(@) creates 
the corresponding antiquark. The quark creation and annihilation operators obey 


{b(a), b} = Sua, {4@), d"(@’)} = Saw, 
{b(a), b(a’)} =0, {d(@), d@’)} = 0, 
{b(a), d'(@’)} =0, (1.2) 


which are the usual anticommutation relations for fermions. 

In all practical quark models, an assumption is made which greatly simplifies 
subsequent steps in the analysis, that the spatial, spin, and color degrees of freedom 
factorize, at least in lowest-order approximation. This is true provided the zeroth- 
order hamiltonian is spin-independent and color-independent. Spin-dependent inter- 
actions are then taken into account as perturbations. This assumption allows us to 
write the sets {œ} and {œ} in terms of the spatial (n), spin (s, m,), flavor (q), and 
color (k) degrees of freedom respectively, i.e., a = (n, s, Mms, q, k). If we are con- 
cerned with just the ground state, we can suppress the quantum number n, and for 
simplicity replace the symbols b, dř, etc., for annihilation and creation operators 
with the flavor symbol q (q =u, d, s for the light hadrons), 


bi(n = 0, q, ms, k) > ake 
d'n =0,9, ms.) > Fim (1.3) 


Hadrons are constructed in the Fock space defined by the creation operators 
for quarks and antiquarks. Light hadrons are labeled by the spin (S?, S3), isospin 
(T?, T3), and hypercharge (Y) operators as well as by the baryon number (B). 
Other observables like the electric charge Qe and strangeness S are related to these, 


Qa = T3 + Y/2, S=Y-—B. (1.4) 


Since quarks have spin one-half, the baryon (Q*) and meson (Q Q) configurations 
can carry the spin quantum numbers S = 1/2, 3/2 and S =0, 1, respectively. If we 
neglect the mass difference between strange and nonstrange quarks, then flavor 
SU (3) is a symmetry of the theory, and both quarks and hadrons occupy SU (3) 
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Fig. XI-1 Some SU(3)-flavor representations. 


multiplets. The quarks are assigned to the triplet representation 3 and the antiquarks 
to 3*. The QQ and Q? constructions then involve the group products 


3x3*=861, 
(3x 3)x3= (603°) x3=100890861, (1.5) 


so that baryons appear as decuplets, octets, and singlets whereas mesons appear 
as octets and singlets. The SU (3)-flavor representations 3, 3*, 8, 10 are depicted 
in Y vs. T; plots in Fig. XI-1. The circle around the origin for the eight-dimensional 
representation denotes the presence of two states with identical Y, /; values. Finally, 
quarks and antiquarks transform as triplets and antitriplets of the color SU (3) 
gauge group, and all baryons and mesons are color singlets. 

Two simple states to construct are the p] meson and the A35 baryon, 


1 
+) 
lon) = A 


. —} 1 fe de 
windigl0), (A373) = Geijam iti tes 0), (1.6) 


where the superscript and subscript on the hadrons denote electric charge and 
spin component, and a summation over color indices for the creation operators is 
implied. The normalization constants are fixed by requiring that the hadrons {H,} 
form an orthonormal set, (Hm| Hn) = mn. The other ground-state hadrons can be 
reached from those in Eq. (1.6) by means of ladder operations in the spin and fla- 
vor variables. In this manner, one can construct the flavor—spin—color state vec- 
tors of the O~ octet and singlet mesons and the octet baryons displayed in 
Tables XI-1 and XI-2. 

A convenient notation for fields which transform as SU (3) octets involves the 
use of a cartesian basis rather than the ‘spherical’ basis of Tables XI-1,2. In fact, 
we have already encountered this description in Sect. VII-1 during our discussion 
of SU(3) Goldstone bosons where the quantity U = exp(ig - A) played a central 
role. The eight cartesian fields {p4} are related to the usual pseudoscalar fields by 
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Table XI-1. State vectors of the pseudoscalar octet and 
singlet mesons. 


+ —+ 
uld, —u} ai] |0) 
~l ytz 
x ) = ya lh — d} m},] 10) | 
0, _ _1 at t7 
n°) = J [ui E, +u yah + dhdi, — dydi] 10) 


K+) = Fe uhh- Hii] 10} 
K?) = Fe shi, — s di4] 10) 
K”) =- [sj T}, — s}, T] 10) 
K~)= J lasi, dish] |0) 
m8) = ag lei i o -dj C 28h, +2515] 10) 


p 
m) = i [u} Uiui, — Uj Uig + didi, dj al + 5)45 h= Shy 314] 10) 


+ 1 
mo = (oi Fig), T =p, Ns = 9s, 
+ 1 , 
KY = va Figs), 
1 —0 1 
K? = —<(y6 = ig), K = (yo + ig), (1.7) 
J Po — 197 J p 97 
which is an alternative way of stating the content of Eq. (VII-1.12). The physical 
spin one-half baryons p,n,... can likewise be expressed in terms of an octet of 
states {B;} (i =1,..., 8) in cartesian basis as 
+ 1 . 
D = gA Fi Ba), X’ = B, A = Bs, 
l (B4 — i B5) : (Bo — i B7) 
P = —1 ’ n= 671 ’ 
a.” v2 ' 
1 , ae 1 : 
a° = g” + iB37), a7 = ge +iBs). (1.8) 


In the quark model, hadron observables have simple interpretations, e.g., the 
baryon number is simply one-third the difference in the number of quarks and 
antiquarks, etc. Thus, writing quark and antiquark number operators as N (q) and 
N(q) for a quark flavor g, we have 


B = [N (u) + N@) + Ns) — N) — N(d) — N(5)]/3, 
T3 = [N (u) — N(d) — N (u) + N(d)]/2, 
= [N (u) + N(d) — 2N (s) — N (u) — N (d) + 2N(5)]/3, 
Qa = [2N (u) — N (d) — N (s) — 2N (0) + N(d) + N(S)]/3, (1.9) 
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Table XI-2. State vectors of baryon 
spin-one-half octet. 


— Eijk i i) 
Pr) = Jis lu "idj — ujdi] ugy 10) 
ny) = Fg [dine — dj u'y] dey 10) 
€ij t - t+ 7.7 
Ay) = 5 lund; - udial Skt 10) 
A ae oe x ae 
£?) = sie [sial ul, + shd yul | -25) dlia] |0) 
=) = £k jst d? — st dt 
x) a Tie [sidj 5; 44] t |0) 
= ij Í 
Bt) = H sipuh — siig 1 sky 10) 
m—\ _ “ijk ij t 
E7) = SX [shd — sidt] s}, 10) 


and the hadronic spin operator is 


+ (O ) mms 
S = a, dim! - dims» (1.10) 
q 


2 


Quark spatial wavefunctions 


Many applications of the quark model require the knowledge of the quark spatial 
wavefunctions within hadrons. It is here that the greatest variation in the differ- 
ent models can occur, but several general features still remain. Indeed, in many 
instances it is the general features that are primarily tested. 

For example, the ground state in all models is a spatially symmetric S state in 
which the wavefunction peaks at r =0. The normalization condition of the quark 
spatial wavefunction, 


fas Ww (x)W(x) = 1, (1.11) 


ensures that the magnitude of y will be similar in those models having wavefunc- 
tions of comparable spatial extent. This accounts for the agreement which can be 
found among diverse quark models in specific applications. How does one fix the 
spatial extent? One approach is to use an observable like the hadronic electromag- 
netic charge radius, e.g., 


(r?) — 0.87 + 0.02 fm, (r?) 12 = 0.66 + 0.02 fm. (1.12) 


proton pion 
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Fig. XI-2 Quark probability density in the bag and oscillator models. 


Viewed this way, the bound states are seen to define a scale of order 1 fm. For exam- 
ple, we display two models in Fig. XI-2, the oscillator result with a? = 0.17 GeV? 
and the bag profile, which are each obtained by fitting to ground-state baryon 
observables like the charge radius. Not surprisingly, their behaviors are quite simi- 
lar. Also shown in Fig. XI-2 is an oscillator model wavefunction whose parameter 
(a? = 0.049 GeV”) was determined by using data from decays of excited hadrons. 
The difference is rather striking, and serves to demonstrate that the most impor- 
tant general feature in setting the scale in quark model predictions of dimensional 
matrix elements is the spatial extent of the wavefunction. ! 

Another aspect of quark wavefunctions involves the issue of relativistic motion. 
A relativistic quark moving in a spin-independent central potential has a ground- 
state wavefunction of the form 


_(iufr)x —iEt 
Wena(x) = (a x) ge (1.13) 


where u, £ signify ‘upper’ and ‘lower’ components. As we shall see, in the bag 
model these radial wavefunctions are just spherical Bessel functions. The above 
form also appears in some relativized harmonic-oscillator models, which use a cen- 
tral potential. If we allow for relativistic motion, then the major remaining differ- 
ence in the quark wavefunctions concerns the lower two components of the Dirac 
wavefunction. Nonrelativistic models automatically set these equal to zero, while 
relativistic models can have sizeable lower components. Which description is the 


1 We could obtain a bag result which behaves similarly by employing a charge radius of 0.5 fm rather than the 
1 fm value shown. 
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correct one? Quark motion in light hadrons must be at least somewhat relativistic 
since quarks confined to a region of radius R have a momentum given by the uncer- 
tainty principle,” 


p> V3R7'!~342MeV (for RX 1 fm). (1.14) 


Since this momentum is comparable to or larger than all the light-quark masses, 
relativistic effects are unavoidable. A more direct indication of the relativistic 
nature of quark motion comes from the hadron spectrum. Nonrelativistic systems 
are characterized by excitation energies which are small compared to the con- 
stituent masses. In the hadron spectrum, typical excitation energies lie in the range 
300-500 MeV, again comparable to or larger than light-quark masses. Such con- 
siderations have motivated relativistic formulations of the quark model. 


Interpolating fields 


In the LSZ procedure (App. B-3) for analyzing scattering amplitudes the central 
role is played by interpolating fields. These are the quantities which experience the 
dynamics of the theory in the course of evolving between the asymptotic in-states 
and out-states. They turn out to be also useful as a kind of bookkeeping device. That 
is, one way to characterize the spectrum of observed states is to use operators made 
of appropriate combinations of quark fields y(x). For example, corresponding to 
the meson sector of QQ states, one could employ a sequence of quark bilinears, 
the simplest of which are 


vy, wysy, Wr, WYuys W, WOW sive (1.15) 


Any of these operators acting on the vacuum creates states with its own quantum 
numbers. The lightest states in the quark spectrum will be associated with those 
operators which remain nonzero for static quarks, i.e., with creation operators and 
Dirac spinors of the form 


0 aay m 4 
y~ (an, Y~ Jeh (1.16) 


Only the pseudoscalar operators Wysw, Yyoysy and the vector operators Yy; y, 
WoviW are nonvanishing in this limit. All the other operators have a nonrelativ- 
istic reduction proportional to spatial momentum, indicating the need for a unit of 
orbital angular momentum in forming a state. 

The interpolating-field approach is particularly useful in situations where the 
imposition of gauge invariance determines whether a given field configuration can 
occur in the physical spectrum. We shall return to this point in Sect. XIH—4 in the 


2 The v3 factor is associated with the fact that there are three dimensions. 
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course of discussing glueball states. We now turn to a summary, carried throughout 
the next three sections, of various attempts to model the dynamics of light-hadronic 
states. 


XI—2 Potential model 


The potential model posits that there is a relatively simple effective theory in which 
the quarks move nonrelativistically within hadrons. In the light of our previous 
comments on relativistic motion, this would seem to be acceptable only for truly 
massive quarks like the b quark and certainly questionable for the light quarks 
u, d, s. However, in the potential model it is assumed that QCD interactions dress 
each quark with a cloud of virtual gluons and quark—antiquark pairs, and that the 
resulting dynamical mass contribution is so large that quarks move nonrelativistic- 
ally. These ‘dressed’ degrees of freedom are called constituent quarks, and their 
masses are called ‘constituent masses’. Constituent masses are not to be directly 
identified with the mass parameters occurring in the QCD lagrangian.° Energy lev- 
els and wavefunctions are then obtained by solving the nonrelativistic Schrédinger 
equation in terms of the constituent masses and some assumed potential energy 
function. 

The potential model is not without flaws. For light-quark dynamics, it is far 
from clear that a static potential can adequately describe the QCD interaction. 
Even with the use of constituent masses, one finds from fits to the mass spectrum 
and/or the charge radius that quark velocity is nevertheless near the speed of light 
(cf. Prob. XI-1). Also, although it is possible [LeOPR 85] to make a connection 
between the lightest pseudoscalar mesons as Goldstone bosons on the one hand and 
QQ composites on the other, this is not ordinarily done. Such criticisms notwith- 
standing, the nonrelativistic quark model does provide a framework for describing 
both ground and excited hadronic states, and brings a measure of order to a spec- 
trum containing hundreds of observed levels. Besides, virtually all physicists are 
familiar with the Schrödinger equation, and find the potential model to be an under- 
standable and intuitive language. 


Basic ingredients 


One begins by expressing the mass M, of a hadronic state œ as 


Mu=)>_ MACE, (2.1) 


l 


3 We shall continue to denote the QCD mass parameter of quark q; as m;, and shall write the corresponding 
constituent mass as M;. 
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where the sum is over the constituent quarks and antiquarks in œ. The internal 
energy E, is an eigenvalue of the Schrödinger equation 


H Ya = Ea Ya, (2.2) 
with hamiltonian 
1 
H = 2, m + = Vootor (Tij), (2.3) 
f i<j 


where r;; = r; — rj, and the subscript ‘color’ on the potential energy indicates 
that the dynamics of quarks necessarily involves the color degree of freedom in 
some manner. It is standard to assume that the potential energy is a sum of two- 
body interactions. Although there exists no unique specification of the interquark 
potential V.o1o, from QCD, the following features are often adopted: 


(1) a spin-and flavor-independent long-range confining potential, 
(2) a spin-and flavor-dependent short-range potential, 

(3) basis mixing in the baryon and meson sectors, and 

(4) relativistic corrections. 


We shall discuss specific models of the potential energy function in Sect. XIII-1. 
They all have in common the color dependence in which the two-particle potential 
is twice as strong in mesons as it is in baryons, 


V(r;;) (mesons), 


Voolor (rij) = | (2.4) 


5 V(r;;) (baryons). 

We shall describe a simple empirical test for such behavior at the end of this 
section. To appreciate its theoretical basis, note that the quark—antiquark pair in 
a meson must occur in the 1 representation of color, whereas any two quarks in a 
baryon must be in a 3* representation (in order that the three-quark composite be a 
color singlet), 


F'(3)- F(3*) (mesons), 


Veolor xX 
F(3)- F(3) (baryons), 


2 = 2 = 2(2* pes 
H (1) — F? (3) — F2(3*))/2 = —4/3 (mesons), ae 


(F? (3*) — 2F?(3))/2 = —2/3 (baryons), 
where F“(R) is a color generator for SU (3) representation R. Thus, the color 


dependence in Eq. (2.4) is that which one would naturally associate with the inter- 
action between two quarks or a quark—antiquark pair. 
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Table XI-3. Quantum numbers of 
QQ composites. 


L Singlet Triplet 

0 1 So(07+) 3S1077) 

1 1P A+) "Pai oO" 14+, 24+) 

2 'D2(2-7) "Digs .2 3) 

3 133+) 3 Fy 3, 4(2t+, 3++, 4++) 
Mesons 


For the two-particle Q Q system, it is straightforward to remove the center-of-mass 
dependence. In the center-of-mass frame the Schrédinger equation becomes 


2 
& 4 vin) Volt) = Ea Walt). (2.6) 


where r=rg — rọ and M L= M7ī' + M7' is the inverse reduced mass. The LS 
coupling scheme is typically employed to classify the eigenfunctions of this prob- 
lem. One constructs the total QQ spin, S = Sọ + Sg, and adds the orbital angular 
momentum L to form the total angular momentum J = S + L. There is an infinite 
tower of eigenstates, each labeled by the radial quantum number n and the angular 
momentum quantum numbers J, Jz, L, S. 

The QQ states are sometimes described in terms of spectroscopic notation 
2S+1T (JPC), where P is the parity and C is the charge conjugation, 


P= (=. C= (=, (2.7) 


Strictly speaking, although only electrically neutral particles like 7° can be eigen- 
states of the charge conjugation operation, C is often employed as a label for 
an entire isomultiplet, like m = (a+, 2°,2~). The lowest QO orbital configu- 
rations, expressed in *°+!L,(J?°) notation, are displayed in Table XI-3. The 
0+, 17,2+,... series of J? states is called natural, and has the same quantum 
numbers as would occur for two spinless mesons of a common intrinsic parity. The 
alternate sequence, 0-,1*,27,... is referred to as unnatural. There are a num- 
ber of J? configurations, called exotic states, which cannot be accommodated 
within the QQ construction. For example, the 077 state is exotic because any state 
with J =0 must have L = S, and according to the Q O constraint of Eq. (2.7) must 
therefore carry C = +. Likewise, the CP= — 1 sequence 0+7, 17t, 2+7,... is 
forbidden because the Q Q model requires CP = (—)**!, implying S = 0 and hence 
J=L. Thus, one would obtain P=(—)/+! in the QO model and not 
P=(-)’. 
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Baryons 


Most applications of the quark model for Q? baryons involve the light quarks. If, 
for simplicity, we assume degenerate constituent mass M, the Schrédinger equa- 
tion is 
1 3 
Hy = 57 am DP + D V (ry), (2.8) 
i<j 


where the prefactor of 1/2 in the potential energy term follows from Eq. (2.4). It is 
convenient to define a center-of-mass coordinate R and internal coordinates à and 


p by 
= (rı + r2 + r3)/3, 
p = tı = r2)/ V2, 
à = (ri +r — 2r3)/ v6. (2.9) 
Because it is not possible to remove the three-particle center-of-mass dependence 


for an arbitrary potential, the following approach is often followed [IsK 78]. The 
potential V (r;;) is rewritten as 


V (rij) = Vose (rij) + U (rij), (2.10) 
where 
k 5 
Ves ii U = V — Voce. (2.11) 


The Schrödinger equation is solved in terms of the oscillator potential and U is 
evaluated perturbatively in the oscillator basis. Having removed the center-of-mass 
coordinate, we are left with the following hamiltonian for the internal energy: 


2 2 
3k 3k 
Hn = (ž + ža) + (2 + =H) (2.12) 
m m 


which is just that of two independent quantum oscillators each with spring con- 
stant 3k. For later purposes, we write the number of excitation quanta for the two 
oscillators as N, and N, (Np =0, 1,2,...) and let N = N, + Nj. The angular 
momentum for the three-quark system is found in a similar manner as for the Q Q 
mesons, J = L + S. The total quark spin is S = }` s;, the orbital angular momen- 
tum is given by L= L, + L}, and the parity is P =(—)"°t. The ground-state 
wavefunction has the form 


5x2 
a ; bara 
Wena(r1, r2, r3) = (=) eP Ro- 0 ee. (2.13) 
TT 


302 Phenomenological models 


where œ? = (3km)'/*. A cautionary remark is in order. One should not misinterpret 
the use of an oscillator potential — it is not the intent to model the observed baryon 
spectrum as that of a system of quantum oscillators because such a picture would 
fail. For example, the oscillator spectrum has Ey ~ N, whereas the baryon spec- 
trum obeys the law of linear Regge trajectories (cf. Sect. XIII-2), Ey ~ N. The 
oscillator potential provides a convenient basis for structuring the calculation and 
nothing more. 


Color dependence of the interquark potential 


Short of doing a complete spectroscopic analysis, we can find experimental support 
in the following simple example for the assertion that the two-particle interquark 
potential is twice as strong in mesons as it is in baryons. 

A potential model description for the meson and baryon mass splittings p (770) — 
x (138) and A(1232) — N (939) is given by a QCD hyperfine interaction, Hnyp, akin 
to the delta function contribution in the QED hyperfine potential of Eq. (V—1.16), 


Hhyp = ka) Hij 8-8; @) (a =M, B), (2.14) 
i<j 

where the {H,; j} are constants and, assuming the color dependence is that given by 
Eqs. (2.4), (2.5), km = 1 for mesons and kg = 1/2 for baryons. We shall discuss in 
Sect. XII-2 how this effect could arise from gluon exchange. Although there is 
ordinarily dependence on quark mass in the {Hi}, it suffices to treat the {Hj} as 
an overall constant since the hadrons in this example contain only light nonstrange 
quarks. The point is then to see whether the condition ky = 2k, is in accord with 
phenomenology. Noting that for mesons the spin factors yield 
1/4 (S=1), 


— 2.15 
{ (S =0), oe 


_ 28? 3 


S1 -So 4 


whereas for baryons one has 


ne = 2.15b 
2 8 —3/4 (S=1/2), ee 


i<j 


48? —9 i 3/4 (S=3/2), 


we find after taking expectation values that 


mp-mr _ 2ku lu)? _ 2km (Volume) _ 2km l (7). J Bi 

ma—my 3kg |YgB(0)|? 3kg (Volume)ų 3kg | (r°}m l l 
The measured values (cf. Eq. (1.13)) of the proton and pion charge radii imply that 
ky/kpg = 2. This example, along with others, lends credence to the assumed color 
dependence of Eq. (2.4). 
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At this point we shall temporarily leave our discussion of the potential model to 
consider other descriptions of hadronic structure. We shall return to the potential 
model for the discussion of hadron spectroscopy in Chaps. XII-XIII. 


XI-3 Bag model 


A superconductor has an ordered quantum mechanical ground state which does 
not support a magnetic field (Meissner effect) and which is brought about by a 
condensation of dynamically paired electrons (Cooper pairs). An order parameter 
for this medium is provided by the Landau-Ginzburg wavefunction of a Cooper 
pair. Even at zero temperature, a sufficiently strong magnetic field, Ber, can induce 
a transition from the superconducting phase to the normal phase. For example, 
in tin the critical field is B.,(tin)Y 3.06 x 107? tesla, and the energy density of 
superconducting pairing (condensation energy ) is Usuper/ V = 373 J/m*. 

Chromodynamics exhibits similar behavior, and this is the basis for the bag 
model [ChJJTW 74]. The QCD ground state evidently does not support a chromo- 
electric field, and is thus analogous to the superconducting state, although a com- 
pelling description of the QCD pairing mechanism has not yet been provided. In the 
bag model, the analog of the normal conducting ground state is called the pertur- 
bative vacuum. The vacuum expectation value of the quark bilinear qq (q =u, d, s) 
plays the role of an order parameter by distinguishing between the two 
vacua, 


ocp\9|Gq|9) ocp < 9, pert(0|Gq|0) pert = 0. (3.1) 


Hadrons are represented as color-singlet ‘bags’ of perturbative vacuum occupied 
by quarks and gluons. The bag model employs as its starting point the lagrange 
density [Jo 78] 


Lyag = (Locp — B) 6(4q), (3.2) 


where the @ function (which vanishes for negative argument) defines the spatial vol- 
ume encompassed by the perturbative vacuum. B is called the bag constant, and 
is often expressed in units of (MeV)*. Physically, it represents the difference in 
energy density between the QCD and perturbative vacua. Phenomenological deter- 
minations of B yield B!/4 ~ 150 MeV, which translates to a QCD condensation 
energy of Ugcnp/V ~ 1.0 x 10°% Jm~3. Although huge on the scale of the con- 
densation energy for superconductivity, this value appears less remarkable in more 
natural units, B ~ 66 MeV fm~”. 
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Static cavity 


To obtain the equations of motion and boundary conditions for the bag model, we 
must minimize the action functional of the theory. We shall consider at first a sim- 
plified model consisting of a bag which contains only quarks of a given flavor g and 
mass m. The equations of motion that follow from the lagrangian of Eq. (3.2) are 


(i —m)q = 0, (3.3) 

within the bag volume V and 
in "yuq =4, (3.4a) 
n,” (qq) = 2B (3.4b) 


on the bag surface S, where n,, is the covariant inward normal to S. Eq. (3.3) 
describes a Dirac particle of mass m moving freely within the cavity defined by 
volume V. Since the order parameter qq vanishes at the surface of the bag, the 
linear boundary condition in Eq. (3.4a) amounts to requiring that the normal com- 
ponent of the quark vector current also vanish at the surface. Thus, quarks are 
confined within the bag. The nonlinear boundary condition represents a balance 
between the outward pressure of the quark field and the inward pressure of B. 


Spherical-cavity approximation 


In principle, the bag surface should be determined dynamically. However, the only 
manageable approximation for light-quark dynamics is one in which the shape of 
the bag is taken as spherical with some radius R. For such a static configuration, 
the nonlinear boundary condition becomes equivalent to requiring that the energy 
be minimized as a function of R. The static-cavity hamiltonian is 


H= / ax [q (ig - V)q + q Bmq + B] ; (3.5) 
v 


Observe that B plays the role of a constant energy density at all points within the 
bag. As in Eq. (1.1), the normal modes of the cavity-confined quarks and antiquarks 
provide a basis for expanding quantum fields. They are determined by solving the 
Dirac equation Eq. (3.3) in a spherical cavity. We characterize each mode in terms 
of a radial quantum number n, an orbital angular momentum quantum number £ (as 
would appear in the nonrelativistic limit), and a total angular momentum, j. Only 
j = 1/2 modes are consistent with the nonlinear boundary condition since the rigid 
spherical cavity cannot accommodate the angular variation of j > 1/2 modes. 
Such nonspherical orbitals can be treated only approximately, by implementing the 
nonlinear boundary condition as an angular average or by minimizing the solution 
with respect to the energy. In addition, since neither p;/2 modes nor radially excited 
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1/2 modes are orthogonal to a translation of the ground state, they must be admixed 
with some of the j = 3/2 modes to construct physically acceptable excitations. For 
these reasons, the bag model has been most widely applied in modeling properties 
of the ground-state hadrons rather than their excited states. 

Let us consider the s1/2 case in some detail. Even with the restriction to a single 
spin-parity state, there are still an infinity of eigenfrequencies w,. Each a, is fixed 
by the linear boundary condition, expressible as the transcendental equation 


Pn 
tan = ae (n= 1,2,...), (3.6) 
where pa = ./w; — m?R?. For zero quark mass, the lowest eigenfrequencies are 
w= 2.043, 4.611, .... For light-quark mass (mR < 1) the lowest mode frequency 
is approximated by wm, ~ 2.043 + 0.493mR, and in the limit of heavy-quark mass 
(mR > 1) becomes w, —> vm? R? + 212. The spatial wavefunction which accom- 
panies destruction of an s;/2 quark with spin alignment à and mode n is 


1 ijo(par/ R) Xi. 
Pri a) = —( . S.. We (3.7) 
V4r \-€fi(Par/R)o -FX 
while for creation of an 51/2 antiquark we have 
1 (—ieji(Par/R)o - 7X, 
pia) = —=( 7 , (3.8) 
v4r Jo(Par/R) Xa 


where € = ((@, — mR)/(@n + mR))'/”, x, is a two-component spinor, and x, = 
i02 X. The full quark field q(x), expanded in terms of the sı;2 modes, is given by 
q(x) = > N (on) [Yn e a bm + poe A R], 86D 


where 


p’ 1/2 
N (@n) = n 3.10 
On) (qo — 2, + mR) sin? —) ( ) 


is a normalization factor which is fixed by demanding that the number operator 
N; = Joi d°x qi (x)q (x) for quark flavor q have integer eigenvalues. 

By computing the expectation value of the hamiltonian in a state of N quarks 
and/or antiquarks of a given flavor, one obtains 


(H) = Now/R + 4r BR? /3 — Zo/R. (3.11) 


In the final term, Zo is a phenomenological constant that has been used in the 
literature to summarize effects having a 1/R dimension, most notably the effect 
of zero-point energies, which for an infinite-volume system would be unobserv- 
able. However, just as the Casimir effect is present for a finite-volume system 
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Yk, Ti 


(a) (b) 
Fig. XI-3 Quarks in a bag. 


with fixed boundaries, such a term must be present in the static cavity bag model 
[DeJJK 75]. Unfortunately, a precise calculation of this effect has proven to be 
rather formidable, and so one treats Zp as a phenomenological parameter. 

Upon solving the condition 0(H)/d0R =O, we obtain expressions for the bag 
radius 


1 
Rt = —(No — Zp), 3.12 
an B' 7 0) ( ) 
and the bag energy 
4 
E = 3 Ar B) No =Z ^. (3.13) 


The bag energy E is not precisely the hadron mass. Although the bag surface 
remains fixed in the cavity approximation, the quarks within move freely as inde- 
pendent particles. Thus, at one instant, the configuration of quarks might appear 
as in Fig. XI-3(a), whereas at another time, the quarks occupy the positions of 
Fig. XI—3(b). As a result, there are unavoidable fluctuations in the bag center-of- 
mass position. The bag energy is thus E = (yp? + M7’), where M is the hadron 
mass and p represents the instantaneous hadron momentum. Although the average 
momentum vanishes ((p) = 0), the fluctuations do not, ((p?) Æ 0). For all hadrons 
but the pion, it is reasonable to expand the bag energy in inverse powers of the 
hadron mass, 


E=M-+(p’)/2M+.---. (3.14) 

For the pion, one should instead expand as 
E = (pl) + Mz(pl-')/2+---. (3.15) 
One can employ the method of wave packets, to be explained in Sect. XII-1, to esti- 


mate that (|p|) ~ 2.3R7', (\p|~') ~ 0.7R for the pion bag, and (p?) ~ Nw; R? 
for a bag containing N quarks and/or antiquarks in the s;/2 mode. 
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Gluons in a bag 


Any detailed phenomenological fit of the bag model to hadrons must include the 
spin—spin interaction between quarks. One way to incorporate this effect is to posit 
that gluons, as well as quarks, can exist within a bag. With only gluons present, the 
lagrangian is taken to be [Jo 78] 


[= [5ra = B 0 (—F2,F*™ /4— B), (3.16) 
and the Euler-Lagrange equations are 
OY Fly =9 (3.17) 
in the bag volume V, and 
n" Fi, =0 (3.18a) 
Five = —4B (3.18b) 


on the bag surface S. In the limit of zero coupling, the gluon field strength becomes 
Fiy = 39u A5 — Ag The field equations in V are sourceless Maxwell equations 
with boundary conditions x - E° =Q and x x B° =0 on S, where E° and B® are 
the color electric and magnetic fields, respectively. It is convenient to work directly 
with the gluon field A“ (x), and with a gauge choice to restrict the dynamic degrees 
of freedom to the spatial components. In mode n, these obey 


[V? + (kn/R)?] AS = 0, (3.19) 
and 
V-A =0 (3.20) 


within the bag. The gluon eigenfrequencies k,, are determined by the linear bound- 
ary condition 


rx (V x A4) =0. (3.21) 


Restricting our attention to modes of positive parity, we have for the gluon field 
operator 


A%(x) = È Ne (kn) (ji (kar/R)X10 (Q)af o + h.c.), (3.22) 


where X,, is a vector spherical harmonic. The gluon normalization factor is obtained, 
analogously to N (œn) for quarks, by constraining the gluon number operator to be 
integer-valued and we find 


[Ne (kn)? = [B0 — sin(2k,)/2k,) — 2(1 + kå) sin? (kn) | R?. (3.23) 
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The quark-gluon interaction 


In the following, we shall work with the lowest positive parity mode, for which 
k, =2.744. The quark hyperfine interaction in hadron H can be computed from 
the second-order perturbation theory formula, 


Enyp = (H|Hq—2(Eo — Ho + i)! Hael H), Cet 


where the unperturbed hamiltonian Hp is given in Eq. (3.5) and Hy_, is the quark- 
gluon interaction 


Ay, = -e f dx J (x) -A%(x), (3.25) 
v 
defined in terms of the quark color current density 


1 
I(x) = 54 0yh). (3.26) 


Implicit in Eq. (3.24) is an infinite sum over all intermediate states. In practice, the 
sum can be well approximated by the lowest-energy intermediate state, and we find 
for hadron H 


Enyp = (H | Hhypl H) = ashyR', (3.27) 
where 
hy = —0.177(H| ` 0-0; F; - F; |H). (3.28) 
i<j 


The numerical factor arises from an overlap integral of quark and gluon spatial 
wavefunctions, and F;, ø; are, respectively, the color and spin operators for quark 
i. It is straightforward to demonstrate that h, =0.708, hy = — ha =h,/2, and 
ho = = hg/3: 

We have described the primary ingredients of the bag model. Fits to the masses 
of the ground-state hadrons can be accomplished within this framework, for exam- 
ple in [DeJJK 75, DoJ 80]. These reproduce many of the features of these particles, 
and we return to baryon properties in the next chapter. 


XI-4 Skyrme model 


In Chap. X, we explored the Ne — oo limit of QCD. In some respects the world 
thus defined is not unlike our own. Mesons and glueballs exist with masses which 
are O(1) as Ne —> oo. To lowest order, these particles are noninteracting because 
their coupling strength is O(N~'). What becomes of baryons in this world? It 
takes N. quarks to form a totally antisymmetric color-singlet composite, so baryon 
mass is expected to be O(N,). Note the inverse correlation between interparticle 
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coupling O(N-') and baryon mass O(N,). This is reminiscent of soliton behavior 
in theories with nonlinear dynamics. 


Sine—Gordon soliton 


An example is afforded by the Sine-Gordon model, defined in one space and one 
time dimension by the lagrangian, 

a 
G 
where @ and $ are constants. For small-amplitude field excitations, an expansion 
in powers of ọ, 


1 
Lsc = 5 (9) — By (1 — cos Bg), (4.1) 


1 a ap? 
Lsa = 5 (9)? — Sy? + F ot + 0 (B99), (4.2) 


identifies the parameter œ as the boson squared mass and £ as a coupling strength. 
For  — 0 we recover the free field theory. The Sine—Gordon lagrangian has also 
a nonperturbative static solution, 


go(x) = s tan™' (exp (vax)), (4.3a) 
with energy 


Eo = 8Va/B?. (4.3b) 


This solution is a Sine—Gordon soliton. The natural unit of length for the soli- 
ton is a~'/?, and the energy Eo diverges as the coupling is turned off (8 — 0). 
The potential energy in this theory has an infinity of equally spaced minima, with 
gy =2nn/B (n=0, +1, +2,...). As the coordinate x is varied continuously 
from —oo to +00, the soliton amplitude g(x), starting from the minimum g® =0, 
moves to the adjoining minimum yg“) = 27/8. An index AN, the winding number, 
counts the number of minima shifted. It can be expressed as the charge associated 
with a current density, 


J" = cca e" OQ, (4.4) 
20 
such that 
AN = / dx J°(x) = £ [p (+20) — y(—oo)]. (4.5) 


For ọ = po as in Eq. (4.3a) we see that AN = 1. The current density is conserved, 
Əə J” =0. Thus its charge, the winding number AN, does not change with time. 
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This is an example of a topological conservation law, whose origin lies in the non- 
trivial boundary conditions (viz. Eq. (4.5)) which a given field configuration is 
constrained to obey. 


Chiral SU(2) soliton 


Let us now seek a soliton solution for an SU (2); x SU(2)pr invariant theory in a 
spacetime of dimension four. It is natural to consider first the lowest-order chiral 
lagrangian £5, 
F2 ; 

l=" (a,Ua"U"), (4.6) 
where U is an SU(2) matrix which transforms as U — LU R`! under a chiral 
transformation for L € SU(2); and R € SU(2)r. Unfortunately, £2 cannot sup- 
port an acceptable soliton, as the soliton would have zero size and zero energy. 
To see why, recall that the Sine—Gordon soliton has a natural unit of length a~!/?. 
Suppose there is an analogous quantity, R, for the chiral soliton. Then we can write 
the radial variable as r =F R, where 7 is dimensionless. For a static solution, the 
energy becomes 


F? ' 
E= |dxu=- | dxc= A fas Tr (VU-VU'). (4.7) 


Upon expressing the integral in terms of the dimensionless variable 7, we find 
E=aR, where a is a nonnegative number. The energy is minimized at R=0 to 
the value E = 0. This trivial solution is unacceptable, and thus the model must be 
extended. 

The Skyrme model [Sk 61] employs, in addition to £2, a quartic interaction of a 
certain structure, 


F2 ľ. 1 . 72 
C= Tr (a,Ua"U") + mat [ə U UÝ, æU U Y, (4.8) 


where e (not to be confused with the electric charge!) is a dimensionless real- 
valued parameter. The above chiral lagrangian should look familiar, since it is 
part of the general fourth-order chiral lagrangian used in Chap. VII. In particu- 
lar, Eq. (4.8) is reproduced if 2L + 2L} + L4 =0, in which case (32e?) ! = (L} — 
2L' — L})/4. The comparison with the phenomenology of Chap. VII is not com- 
pletely straightforward, as the pion physics was treated to one-loop order while the 
Skyrme lagrangian is used at tree level. We note, however, that the coefficients in 
Table VII-1 give 

2L + 2L; + L} 

L5 = 2L] = L5} 


= 0.685, L} — 2L — L} = 0.0040. (4.9) 
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The latter combination, which is independent of renormalization scale, numerically 
gives e ~ 5.6. In the following development, we shall follow standard practice by 
taking the parameter e as arbitrary. 

We seek a static solution of the Skyrme model. Our strategy shall be to first 
determine the energy functional of the theory, and then minimize it. Following the 
procedure leading to Eq. (4.7), we can write the energy as 


F? : 1 i 
E= fas Tr E X;x! + TBO XiX Nen XX) ; (4.10) 
where X,, = Uð, UŤ and X, = — Xi. It is necessary that X; —> 0 as |x| —> co 


in order that the energy be finite. Thus, U must approach a constant element of 
SU (2), which we are free to choose as the identity 7. For the mesonic sector of 
the theory, the vacuum state corresponds to U(x) = / for all x. In this state, both 
the field variable X; and the energy E vanish. The form U ~ I + im - t/F,, used 
extensively in earlier chapters, corresponds to small-amplitude pionic excitations 
of the vacuum. 

To see that the Skyrme model does support a nontrivial soliton, we cast the 
energy integrals of Eq. (4.10) in terms of a natural length scale R and find 


E=aR+bR'!, (4.11) 


where a, b are nonnegative. For a,b Æ 0, the energy is minimized at nonzero R 
and nonzero E. Thus, the quartic term of Eq. (4.8) is seen to have the desired effect 
of inducing soliton stability. Moreover, for arbitrary U a lower bound on the energy 
is provided by applying the Schwartz inequality to Eq. (4.10), 


Fy 
E> fa f exit €ijkXi Xj Xx. (4.12) 
- l 


It is not hard to show that the integrand of Eq. (4.12) is proportional to the zeroth 
component of a four-vector current, 


evap 


B= Tr X,X, X3, 4.13 
24m2 p ee 


which is divergenceless, ð, B” = 0, and thus has conserved charge 
B= J d’x B? (x). (4.14) 


It turns out that the current B” can be identified as the baryon current density and 
B is the baryon number of the theory. Note that this is consistent with our prescrip- 
tion U (x)= I for the meson vacuum, where we see from Eq. (4.13) that B =0. 
Interestingly, B turns out to have an additional significance. It is the topological 
winding number for the Skyrme model, analogous to AN for the Sine—Gordon 
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model. The point is, by having associated spatial infinity with a group element of 
SU (2) to ensure that the field energy is finite, we have placed the elements of phys- 
ical space into a correspondence with the elements of the compact group SU (2). 
The parameter space of each set is S°, the unit sphere in four dimensions, and it is 
precisely the field U which implements the mapping. The mappings from S° to S? 
are known to fall into classes, each labeled by an integer-valued winding number. 
In this context, B serves to measure the number of times that the set of space points 
covers the group parameters of SU (2) for some solution U of the theory. 


The Skyrme soliton 


The Skyrme ansatz for a chiral soliton (skyrmion) has the functional form [BaNRS 
83, AANW 83] 


Uo(x) = exp [iF(r)t - x]. (4.15) 


The unknown quantity is the skyrmion profile function F (r). To specify it, we first 
determine the energy functional by substituting Up into Eq. (4.10), 


C slike f a „sin? F 
E[F] = 4x drr | —— | F*42 
0 2 r2 
1 sin? F (sin? F 2 
+573 z z +#2F ; (4.16) 


where a prime signifies differentiation with respect to the argument. For a static 
solution, the minimization of the energy generates an extremum of the action, and 
is hence equivalent to the equations of motion. The variation ôE /ôF =Q generates 
a differential equation for F, 


sin2F sin? Fsin2F 
4 r2 


a pe ” r 7 Zai 

gm F)F + ao + F% sin2F — =0, (4.17) 
as expressed in terms of a dimensionless variable 7 = r/R, with R7! = 2eF,. 
This nonlinear equation must be solved numerically, subject to certain boundary 
conditions. The condition U = / at spatial infinity implies F (oo) = 0. The bound- 
ary condition at r =0 is fixed by requiring that the soliton corresponds to baryon 
number 1. For the Skyrme ansatz, the baryon-number charge density is 

1 F'sin® F 


0 = 
B )=—- 73 ~a > (4.18) 


and corresponds to a baryon number 
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Fig. XI-4 Radial profile of the skyrmion. 


B= x [2F (0) — 2F (co) — sin 2F (0) + sin2F(o)]. (4.19) 


This leads to the choice F'(0) = zr. Although the profile F (r) cannot be determined 
analytically over its entire range, it is straightforward to show that 


mz —const.r (r— 0), 
Fir) ~ (4.20) 


const. r~? (r — oo). 


We display F(r) in Fig. XI-4. Insertion of the solution to Eq. (4.17) into the 
energy functional E[F] yields the mass M of the skyrmion, and from a numeri- 
cal integration we obtain M ~ 73 F,,/e. There is an important point to be realized 
about the skyrmion — it represents a use of chiral lagrangians outside the region 
of validity of the energy expansion. Recall that the full chiral lagrangian is writ- 
ten as a power series, L = L2 + £4 + --- in the number of derivatives. When 
matrix elements of pions are taken, terms with n derivatives produce n powers 
of the energy. Hence, at low energy, one may consistently ignore operators with 
large n, as their contributions to matrix elements are highly suppressed. However, 
in forming the skyrmion one employs only £2 and a subset of £4. The relative 
effects of the two are balanced in the minimization of the energy functional, and 
as a result both contribute equally. In an extended model containing £6, one would 
expect the import of L6 to be analogously comparable to £4, etc. Higher-derivative 
lagrangians thus will contribute to skyrmion matrix elements, and the result cannot 
be considered a controlled approximation. However, this is not sufficient cause for 
abandoning the skyrmion approach. It simply becomes a phenomenological model 
rather than a rigorous method, and thus has a status similar to potential or bag 
models. 
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Quantization and wavefunctions 


The analysis done thus far is at the classical level, and merely shows that the chiral 
soliton satisfies the equations of motion. To determine the quantum version of the 
theory, we shall follow a canonical procedure. An analogy with quantization of the 
rigid rotator may help in understanding the process. A classical solution consists of 
the rotator being at any fixed angular configuration {0, ø}. To obtain the quantum 
theory, one allows the rotator to move among these solutions, and describes its 
motion in terms of the angular coordinates and their conjugate momenta {pg, Po}. 
The quantum states are those with definite angular momentum quantum numbers 
{€, m}, and have wavefunctions given by the spherical harmonics, 


(0, plé,m) = Yem(, p). (4.21) 


The classical skyrmion solutions consist not only of Up (cf. Eq. (4.15)), but also 
of any constant SU (2) rotation thereof, Uj = AUyA~! with AeSU(2). A particu- 
larly simple approach to quantization is then to allow the soliton to rotate rigidly in 
the space of these solutions, 


U = A(t)U A7! (t), (4.22) 


where now A(t) is an arbitrary time-dependent SU (2) matrix. One proceeds to 
define a set of coordinates {az}, their conjugate momenta {z, = d£/da,}, and a 
hamiltonian constructed via Legendre transformation 


H = må — L. (4.23) 


We shall presently describe how to choose quantum numbers and determine the 
associated wavefunctions. Note that this approach is approximate in that it neglects 
the possibility of spacetime-dependent excitations such as pion emission. As such, 
it would be most appropriate for a weakly coupled theory (as occurs for Ne —> 00) 
where the soliton rotates slowly, but is only approximate in the real world. 

In general, an SU (2) matrix like A can be written in terms of three unconstrained 
parameters {6,} as 


A(t) = exp(it - 0) =I cos@ +it -Ô sind. (4.24) 
However, we can equivalently employ the four constrained parameters, 
dy = cos 0, a=6 sin 0, (4.25a) 
where 


3 
aS. (4.25b) 


k=0 
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Substitution of the rotated quantity U into Eq. (4.7) and evaluation of the spatial 
integration yields 
3 
L = —M +A Tr (AIA) = -M +24 Y a, (4.26) 
k=0 


where A = mA /3e° Fz, with 
A= J dř F’ sin? F[1+4(F? + sin? F/F°)] ~ 50.9. (4.27) 
As written in terms of the conjugate momenta 2; = 4Ad,;, the hamiltonian is 
H=M+—) > zê. (4.28) 


Adopting the canonical quantization conditions 


lak, 71] = idx, (4.29) 
we see that the canonical momenta can be expressed as differential operators, 
Tg = — ið /ðaz. Thus, the hamiltonian has the form 

1 
H = M —- —V?, 4.30 
3 4 (4.30) 


where Vj is the four-dimensional laplacian restricted to act on the three-sphere by 
the constraint of Eq. (4.25b). 
We can determine the eigenvalues and eigenvectors of H by working in analogy 
with the more familiar three-dimensional laplacian, 
ə 20 A 
v= $ =L (4.31) 


ər? rər r? 


If constrained to the unit two-sphere by the condition 5? _,x? =r? = 1, the three- 
dimensional laplacian v? reduces to —L?. As is well known, the three components 
of L are operators Lı, L2, L3 which satisfy 


[L;, Lx] = é€ ju Ly, (4.32) 


and generate rotations in the 2-3, 3-1, 1-2 planes respectively. The underlying 
symmetry group is SO (3), and the eigenfunctions are the spherical harmonics. 

The four-dimensional problem is treated by analogy. Upon adding an extra dimen- 
sion labeled by the index 0, we encounter the additional operators K,, K2, K3, 
which generate rotations in the 0-1, 0-2, 0-3 planes. The full set of six rotational 
generators can be represented as 


Ly = EijkaiTTj, Ky = doi, — Axo. (4.33) 
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The extended symmetry group is SO (4) and the commutator algebra of the rotation 
generators is 


[Lj, Le] =iejali, [Lj, Ke] =iejaKi, [Kj, Ke] =iejali. (4.34) 


The mathematics of this algebra is well known, underlying, for example the sym- 
metry of the Coulomb hamiltonian in nonrelativistic quantum mechanics. By the 
substitutions 


T = (L — K)/2, J = (L + K)/2, (4.35) 


we arrive at operators T and J, which generate commuting SU (2) algebras. We 
associate T with the isospin and J with the angular momentum. The explicit oper- 
ator representations, 


Tk = i (—€ijkaiðj + aoðk — ax 00), 
Jg = i (—€ijkai90j — ao0k + 4x00), (4.36) 


follow immediately from Eq. (4.33), and the Skyrme hamiltonian becomes 
H = M + (T + JP’) /4. (4.37) 


It follows from the commutator algebra of Eq. (4.34) that T? = J*. Thus, the quan- 
tum spectrum consists of states with equal isospin and angular momentum quantum 
numbers, T = J. This is no surprise. After all, in the Skyrme ansatz of Eq. (4.15), 
the isospin and spatial coordinates appear symmetrically, and we expect the quan- 
tum spectrum to respect this reciprocity. Our final form for the hamiltonian, 


H=M+J/21, (4.38) 
has the eigenvalue spectrum 
E=M+J(J+1)/2), (4.39) 


where in general J =0,1/2,1,3/2,.... 

By analogy with the usual spherical harmonics, the eigenfunctions of H are seen 
to be traceless symmetric polynomials in the {a,}. However, both {a,} and {—a,} 
describe the same solution U (cf. Eq. (4.22)). In the quantum theory, eigenfunc- 
tions thus fall into either of two classes, w({—az}) = + w({az}). Since fermions 
correspond to the antisymmetric choice, we select only the half-integer values in 
Eq. (4.39). In the Skyrme model, the N and A baryons will have wavefunctions 
which are respectively linear and cubic in the {az}. To construct such states, it is 
convenient to employ the differential representations of Eq. (4.36) to prove 


L3(a; + iaz) = (a; tiag), L3ao,3 = 0, 


K3(do + ia3) = +(aọ £ ia3), Kai 2 = 0. (4.40) 
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From these and Eq. (4.36), the 73 = J; = 1/2 eigenstate of a proton with spin up is 
found to be 


1 
(A| pr) = = (a, + ia). (4.41) 


The normalization of this state is obtained from the angular integral over the three- 
sphere 


1 
1 = (py! pr) = f aa, (prlA)(Al py) = = | a9; (aj +a), (4.42) 


where the angular measure is 


20 1 x 
[as = ay f do sino f dx sin’ x, (4.43) 
0 0 0 


and spherical coordinates in four dimensions are defined by 


a, = sin x sin cos@, a = sin x sinô sing, 
1 , X p 2 X P (4.44) 
a3 = sin x cos6, ao = COS X. 
The remaining nucleon states can be found by application of the spin and isospin 
lowering sperators 


J_ = [(a; — ia2)ð3 — (a3 + idp)d, + (~ao + iaz3)ð2 + (a2 + ia1)ðo]/2, 
T_ = [(aı — ia2)ð3 + (—a3 + iaọ)ðı + (ao + 1a3)02 — (a2 + iaı)ðo]/2, (4.45) 


where ðk = 0/ðaz. The T = J =3/2 A states are formed by employing analogous 
ladder operations on 


W2 
(Al A$$) = ne (a + in). (4.46) 


It is remarkable that fermions can be constructed from a chiral lagrangian which 
contains nominally bosonic degrees of freedom. However, the presence of a nonzero 
fermion quantum number can be easily verified by direct calculation. 

The wavefunctions for the eigenstates (the equivalents of Y; m(0, p) for the rigid 
rotator) are given by SU(2) rotation matrices with half-integer values. These are 
defined by the transformation properties of states under an SU (2) rotation A, 


IT, T3) =) Dig, (AJIT, To). (4.47) 
Ts 

The simplest case is then just the T = 1/2 representation, which we know is rotated 

by the matrix A, 


(4.48) 


padya o es . 
ij 


i(a; +idz) (day — iaz) 
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Comparison with Eq. (4.41) and with the results of Eq. (4.47) shows that the prop- 
erly normalized nucleon wavefunctions are 


1 
(Al No,,s,) = —(-) 91" Di (A). (4.49) 


The general case for a nonstrange baryon B of isospin T and spin S (S =T) is 
given by 


IT iT 
(Al Bins) = [| (yap « (A), (4.50) 


of which the A states are specific examples. 
Finally, the N and A masses are 


My = M +3/8A = BF, /e + © Fz /45.27, 
My = M +15/8 = 73F,/e + 5€ Fy /45.20. (4.51) 


If the measured N, A masses are used as input, one obtains e=5.44 and 
F, =65 MeV. Alternatively, from the empirical value for F, and the determination 
e ~ 5.6 from pion-pion scattering data, the model implies My ~ 1.27 GeV, My = 
1.80 GeV. In either case, agreement between theory and experiment is at about the 
30% level. The next state in the spectrum would have quantum numbers 
T = J =5/2 and is predicted by the first of the above fitting procedures to have 
mass Ms/2 = M + 35/8A ~ 1.72 GeV. There is no experimental evidence for such 
a baryon. 

Although the development of the skyrmion and its quantization have been moti- 
vated by large-N, ideas, we know of no proof that requires the skyrmion to come 
arbitrarily close to the baryons of QCD in the N. — oo limit. An oft-cited counter- 
example is the existence of a one-flavor version of QCD. Such a theory still con- 
tains baryons, such as the A++. However, it makes no sense to speak of a one-flavor 
Skyrme model, as an SU (2) group is required for the underlying soliton Up. The 
Skyrme model remains an interesting picture for nucleon structure because it is in 
many ways orthogonal to the quark model, and thus offers opportunities for new 
insights. 


XI-5 QCD sum rules 


Low-energy QCD involves a regime where the degrees of freedom are hadrons, 
and where it is futile to attempt perturbative calculations of hadronic masses and 
decay widths. Contrasted with this is the short-distance asymptotically free limit 
in which quarks and gluons are the appropriate degrees of freedom, and in which 
perturbative calculations make sense. The method of QCD sum rules represents an 
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attempt to bridge the gap between the perturbative and nonperturbative sectors by 
employing the language of dispersion relations [ShVZ 79a]. 

The existence of sum rules in QCD is quite general, and some might dispute 
the classification on these sum rules as a phenomenological method. However, in 
practice, to utilize the sum rules involves the introduction of various approxima- 
tions and heuristic procedures. Like quark model methods, these are motivated 
by physical intuition but are not always rigorous consequences of QCD. As a 
result, there remains a certain degree of uncontrollable approximation in their use. 
Nonetheless, they have been employed in a large number of applications; some 
early reviews are [ReRY 85, Na 89] and for somewhat more recent entries see 
[Ra 98, CoK 00, Sh 10]. 


Correlators 


It is convenient to approach the subject by considering the relatively simple two- 
point functions. Thus, we consider the quark bilinear, 


Ip (x) = qi) q2(x), (5.1) 


where T is a Dirac matrix, and analyze the correlator, 
if atx e!1* (0|T (Jr (x) JE (0))10). (5.2) 


Such quantities can be expressed in terms of invariant functions TIr (q?) and atten- 
dant kinematical factors, e.g., as for the correlators of pseudoscalar currents (Jp) 
and of conserved vector currents (Jy), 


Mp(q*) =i J d*x e'4* (0|T (Jp (x) Jp (0)) |0), (5.3a) 


(q"g” — q?g%”)My(q) =i J dtx &@ (OT (FEC) IEC) 0). (5.3b) 


Analogous structures occur for other currents. 

There are several means for analyzing a quantity like Tr (q°). One is to write 
a dispersion relation based on its singularity structure in the complex q? plane. 
The singularities are just those imposed by unitarity. For example, by inserting 
a complete set of intermediate states into Eq. (5.3a) for the pseudoscalar function 
TI p(q’) and invoking the constraints of Lorentz invariance and positivity of energy, 


we obtain 
[0,6] 
pp (s) 
MT p(q’) =i ds — J eee 
a s—q’—ie 


odpr (q) = Qr)? $ "(Pn — q)1(01 Je O)|ny/’, (5.4) 
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where sọ is the threshold for the physical intermediate states. Such considerations, 
together with the application of Cauchy’s theorem in the complex q? plane, imply 
a dispersion relation for Ir (q°), 


2\N oo 
Mr (q*) = a f a E Ea (5.5) 


s™ (s ~ n=0 


where the {a,} are N subtraction constants.4 One attempts to introduce a phe- 
nomenological component to the dispersion relation by expressing Im IIp(s) in 
terms of measureable quantities, e.g., with cross section-data as in the case of the 
charm contribution Cy“c to the vector current, 


1 Oete=— charm 9s 


(chm) __ 
Im Ny = eee Oe+e-— charms 


Aaa Taes pru (30) 
where e, is the c-quark electric charge and s is the squared center-of-mass energy. If 
such data are not available, another means must be found for expressing Im Ir (s) 
in the range so < $ < &. 

To approximate the low-s part of Im IIp(s), one usually employs one or more 
single-particle states. As an illustration, let us determine the contribution to M p (4°) 
of a flavored pseudoscalar meson M, which is a bound state or a narrow-width res- 
onance of the quark—antiquark pair gq. In this instance, we take the pseudoscalar 
current in the form of an axial-vector divergence, Jp > 3 AÉ, with 


ð AÉ = i (m, + m)]1Y542, 
(013, A£ (O)|M) = V2Fum?;, (5.7) 


where my and Fy are the meson’s mass and decay constant. Then Eq. (5.4) implies 


dp 
8 (qo) Pp(q7) = ony f ao umn (P — 4) 
(27)720, 
= 2Fy,m4,6(q* — m4,)0 (qo), (5.8) 
which yields pp (q?) =2F POACH — mi) for the spectral function or 
Im Mplmeson = 2 F myrs (s — m3,) (5.9) 


for the dispersion kernel. Thus, bound-state or narrow-resonance contributions give 
rise to delta-function contributions. It is not difficult to take resonant finite-width 
effects into account if desired. One or more of these single-particle contributions 
are then used to represent the low-s part of the dispersion integral. 


4 The number of subtraction constants needed depends on the behavior of Im Tp (s) in the s + oo limit, with 
Trp (q?) ~ g?N In q? requiring N subtractions. 
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Table XI-4. Local operators of low dimension. 


d: 0 4 4 6 6 6 


On: 1 mqqq G4,Ga” Glqqlq maoyy*sqGe" — faveG4,G}*G" 


Proceeding to higher s values in the dispersion integral, one enters the continuum 
region, where multiparticle intermediate states become significant and the bound- 
state (or resonance) approximation breaks down. Although, as described below, one 
ordinarily attempts to suppress the large-s part of Im ITp(s) by taking moments 
or transforms of the dispersion integral, it has been common to add to the low-s 
contribution a ‘QCD continuum’ approximation, 


Im IIp(s) ae O(s — se)Im Teont(s), (5.10) 


taken from discontinuities of QCD loop amplitudes and their O (œs) corrections. In 
Eq. (5.10), s- parameterizes the point where the continuum description begins and 
the form of Im [cont depends on the specific correlator. Experience has shown 
that this ‘parton’ description can yield reasonable agreement of scattering data 
even down into the resonance region, provided the resonances are averaged over 
(duality). 


Operator-product expansion 


A representation for correlators which is distinct from the above phenomenologi- 
cal approach can be obtained by employing an operator-product expansion for the 
product of currents, 


if as eT (Jr(x) JRO) = DCG?) On. (5.11) 


The {O,} are local operators and the {cr (q7)} are the associated Wilson coeffi- 
cients. As usual, the {O,} are organized according to their dimension and, aside 
from the unit operator /, are constructed from quark and gluon fields. Table XI—4 
exhibits the operators up to dimension six which might contribute to the correlator 
of Eq. (5.2). 

Although one may naively expect all the operators but the identity to have van- 
ishing vacuum expectation values (as is the case for normal-ordered local operators 
in perturbation theory), nonperturbative long-distance effects like those discussed 
in Sect. HI-5 generally lead to nonzero values. Most often, the operator-product 
approach contains vacuum expectation values like (EG h” ls = (2G7)o and 
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Fig. XI-5 Contributions to coefficient functions. 


(m4qq)o as universal parameters, ‘universal’ in the sense that the same few parame- 
ters appear repeatedly in applications. Calculation reveals that the quantity (“+ G’)o 
is divergent in perturbation theory, so the perturbative infinities must be subtracted 
off if one is working beyond tree-level. In principle, all the vacuum expectation 
values should be computable from lattice gauge theory once the renormalization 
prescriptions are specified. At present, the only theoretically determined combina- 
tions are the products 


(m(au + dd)jo ~ —2F2m?2, (ms5s\o ~ —F2m%,, (5.12) 


which follow from the lowest-order chiral analysis in Chap. VII. We caution that 
only the product mw is renormalization-group invariant (the gluon condensate 
(= G?}o does not, however, depend on scale). It is difficult to separate out the quark 
masses uniquely, and values for input parameters like quark masses and conden- 
sates tend to vary throughout the literature. 

Use of the short-distance expansion must be justified. We have seen in previous 
chapters how a given hadronic system is characterized in terms of the energy scales 
of confinement (A) and quark mass ({m,}). Given these, it is indeed often possible 
to choose the momentum q such that short-distance, asymptotically free kinematics 
obtain. Two situations which have received the most attention are the heavy-quark 
limit (m >> A?,q°) and the light-quark limit (q? > A? > m?). Once in the 
asymptotically free domain, it is legitimate to apply QCD perturbation theory to 
the C! (q°), with the expansion being carried out to one or more powers of as, 


Ch (°) =A), (P) + By (q)as +. (5.13) 


Rather extensive lists of Wilson coefficients already appear in the literature. 
Fig. XI-5 depicts contributions to a few of the Wilson coefficients. Denoting there 
the action of a current by the symbol ‘x’, we display in (a)—(b) the lowest-order 
and an O(a@,) correction to operator 7 and in (c)-(d), the lowest-order contributions 
to (=G?)o and to (mqq)o, respectively. 

Finally, as seen in Eq. (5.13), besides the vacuum expectation values, additional 
parameters which generally occur in the operator-product representation are the 
quark mass mą and the strong coupling œs. Since these quantities will depend on 
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the momentum q, one must interpret them as running quantities whose renormal- 
ization is to be specified. Due to asymptotic freedom, they too can be treated per- 
turbatively, e.g., as in the familiar expression Eq. (II-2.78) for the running coupling 
a; or Eq. (XIV—1.9) for the running mass m. 


Master equation 


The essence of the QCD sum rule approach is to equate the dispersion and the 
operator-product expressions to obtain a “master equation’, 


(CRM f a Im Ir (s) 


N ai 
T ds s (s—q — i€ 


j +- = O (4°) (On)o- (5.14) 


It is important to restrict use of this equation to a range of q? for which both the 
short-distance expansion and also any ‘resonance + continuum’ approximation to 
Im Ip are jointly valid. To satisfy these twin constraints, it is common practice 
not to analyze Eq. (5.14) directly, but rather first to perform certain differential 
operations leading to either moment or transform representations. The nth moment 
M}(Q3) is defined as 


(5.15) 


1%. ImMp(s) 
ds n+1? 
=o; "J (s+ Q5) 


where, in the spacelike region q? < 0, one usually works with the variable 
Q? = —q’. By taking sufficiently many derivatives, one can remove unknown sub- 
traction constants from the analysis and at the same time, enhance the contribution 
of a single-particle state at low s in the dispersion integral. 

Alternatively, one can express the dispersion integral as a kind of transform. The 
Borel transform is constructed from the moment Mr (Q?) as 


nQ"M)(Q*) — — | ds e~*/"Im Tp(s), (5.16) 


n,Q?=>œ TT Joy 


where Q?/n = t remains fixed in the limiting process and defines the transform 
variable t. To obtain the factor e~*/* in the above dispersion integral, we note 


n oO” n ver’ 
= 1+ — 
(s 4 gyt s + Q? ( s/Q ) n,Q?—>o0 


ews/t 


(5.17) 


A slightly different version of exponential transform which has appeared in the 
literature is defined analogously, 
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1 [06] 

ory) (g@’) — = J ds e° Im TIp(s), (5.18) 
n,Q?—>00 T so 

where the transform variable is now o =n/Q*. The transform method serves to 

remove the subtraction constants and to suppress the contributions from operators 

of higher dimension in the operator-product expansion. 


Examples 


Applications of the QCD sum rule approach generally proceed according to the 
following steps. 


(1) Choose the currents and write a dispersion relation for the correlator. 

(2) Model the dispersion integrals with phenomenological input, usually some 
combination of single-particle states and continuum. 

(3) Employ the operator-product expansion, including all appropriate operators up 
to some dimension at which one truncates the series. 

(4) Obtain the Wilson coefficients as an expansion in as. 

(5) Use the moment or transform technique to extract information from the master 
equation. 

(6) Vary the underlying parameters until stability of output is achieved. 


Let us consider several examples, keeping the treatment on an elementary footing 
to better emphasize the kinds of relationships which QCD sum rules entail. In fact, 
modern calculations can be quite technical, involving issues such as optimizing 
the organization of input data, inclusion of ever higher orders of both perturbation 
theory and vacuum condensates. 


(i) Rho meson decay constant f,: This was among the first applications of the QCD 
sum rule approach [ShVZ 79a]. The p isovector current hia and decay constant 


fp are 


o) _ #Yut — dyud 0 (IQ, — ct fomo 
IP =E, PRAP = GP. 619% 
The sum rule which gives fp is [CoK 00] 
2 
S elt ES ie a,(t~!) 
fo = T ls (1 e (14 T 
+ (m, + ma)t’ (qq) + e, (5.19b) 


where t is the Borel parameter, so is threshold above which TI, (s) is to be approx- 
imated via perturbation theory and ellipses represent additional condensate contri- 


butions. The variation of f, vs. 1/t turns out to display little variation in f, for, say, 
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0.6 < t~2(GeV’) < 1.3; this is the Borel window of stability. The range of values 
fo ~ 208 — 218 MeV which occur within the stability window is in accord with 
the experimental determination cited in Eq. (V—3.14) for the equivalent quantity 


§p = ftp V2. 


(ii) Mass of the charm quark: We consider the correlator for the charm-quark vector 
current J i =Cyc, 


(duq — 8uv) e (4°) =i | d‘x elf (OIT Frail 0)) |0), 


(5.20a) 
and the corresponding dispersion relation, 
ð Lf Int 
Hy” (Q?) = f e eaa B LA (5.20b) 
(—3Q ) T Jso (s + Q?) 


Following the original treatment of this system [ShVZ 79a], we work at Q? = 0 
and employ a moment analysis of the short-distance expansion containing just the 
identity and gluon contributions. The experimental input is obtained from 


Morn =] ds Oete-—> cēls) (5.21a) 


SPFL oes pa G) , 


whereas the theory side involves 
12n7e? d 
n! dq?” 


Mn = m a =: (5.21b) 
The moments M{™ can be determined in terms of an operator-product expansion, 
which is dominated by the QCD-perturbative contribution provided the value of 
n is not too large, i.e., m/n > Agcp. Perturbative contributions have long been 
studied and a library of exact results is now available (see [DeHMZ 11]): 


(1) O(a?) and O(a@!): known for all n. 
(2) O(a): known up to n = 30. 
(3) O(a): known up to n = 3. 
Below, we cite two specific O(a?) determinations of the charm quark mass 
[KuSS 07], [DeHMZ 11]. Both adopt the gluon condensate value (= G?}o =0.006+ 


0.012 GeV* (each analysis obtains only a minor effect for this term). The results 
obtained in MS renormalization are 


meme) = (1.286 + 0.013) GeV _—[KuSS 07], 
m.(m,) = (1.277 + 0.026) GeV [DeHMZ 11], (5.22) 
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whereas the value m,.(m,) = (1.275 £ 0.025) GeV is cited in [RPP 12]. The issue 
of how to assign uncertainty in QCD-sum-rule-determinations of charm mass, 
especially involving the data side of the calculation, is currently a topic of some 
interest, 


(iii) Weak decay constant of the D* meson: Consider first the axial-current diver- 
gence and correlator associated with a heavy quark Q and a light antiquark q, 
respectively of mass mg and m,, which comprise a heavy meson Mọ, 


ð AÉ = i(mg +m,)qysQ, 


Tp(q?) =i [as e'* (OT (8, A“ (x)d,A"" (0))|0). (5.23) 
A transformation with Borel variable t yields 
CO 
p(t) = i ; ds e™ Open (S, U) + Tpwr(t, ma, H) (5.24) 
(mo+mg) 


where p =Zm I1/z and Ppen(s, p) and Ipwr(t, mg, p) represent respectively, the 
perturbative and nonperturbative contributions. 

Let us now consider specifically the decay constant of the Dt, where symbol- 
ically D+ ~ (ci). The experimental value, fp = /2Fp =206.7 + 8.9 MeV is 
found from D* —> u*v, via decay formulas akin to the tree-level Eq. (VII-1.24) 
or radiatively corrected Eq. (VII-1.34) for pion leptonic decay. On the theory side, 
it is shown in [LuMS 11] that a straightforward QCD sum rule approach yields 


Seft(T) 


—M? st 
fè Mhe Mp =| ds e” Ppert (5, U) + Tpwr(t, mgo, u), (5.25) 


(me+my)? 


where the condensate values adopted are 


(Gq) (uw) = —(267 + 17 MeV)’, (=e) = (0.024 + 0.012) GeV*, (5.26) 


with u =2 GeV being the MS renormalization scale. The most novel part of the 
expression in Eq. (5.25) is the presence of an ‘effective continuum threshold’ 
Sefe(T). The t-dependence of se supplants the traditional form of Eq. (5.10) in 
which a constant cut-off se is used to describe the onset of continuum contribu- 
tions. We leave a detailed discussion of the effective threshold to [LuMS 11] and 
simply state the final result, 


ae = (206.2 Ba 7.3 (OPE) + 5. Lies) MeV, 
which is consistent with the experimental finding shown above. 


(iv) Nucleon mass: It is not necessary to restrict oneself to mesonic currents as in 
Eq. (5.1). Here, we consider a current ny (and its correlator), which carries the 
quantum numbers of the nucleon, 
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nN = ciju’ Cy”ui ysyd", 
M(q*) = M (q) +4 =i J d*x e'4* (O|T (ny (Tn (0)) 10), 6.27) 


where C is the charge-conjugation matrix. The simplest approximation to the dis- 
persion integral comes from the nucleon pole, 


(5.28) 


where the coupling 4% is proportional to the ‘nucleon decay constant’, i.e., the 
probability of finding all three quarks within the nucleon at one point. Upon making 
a simple approximation to the operator-product expansion, 
2 q? 2\ i> 2 qt 2 
m (P) = -za C) Ga), m) San) 629 
and employing a Borel transform, one obtains an amusing relation between nucleon 
mass and quark condensate [Io 81], 


My = (-87°(4)0) +... = 1 GeV, (5.30) 


and implies the vanishing of the former with the latter. However, it should be 
realized that this result is subject to important corrections in a more complete 
treatment. 

Each of the above examples has involved two-point functions. It is possible to 
apply the method to three-point functions as well, where one can obtain coupling- 
constant relations. The underlying principles are the same, but some technical 
details are modified owing to the larger number of variables, e.g., one encounters 
double-moments or double-transforms. 

QCD sum rules work best when there is a reliable way to estimate the dispersion 
integral, most often with ground-state single-particle contributions. However, the 
method has its limitations. It is not at its best in probing radial excitations since 
their dispersion effects are generally rather small. Even having a good approxi- 
mation to the dispersion integral is not sufficient to guarantee success. For exam- 
ple, the method has trouble in dealing with high-spin (J > 3) mesons because, 
even with dispersion integrals which are dominated by ground-state contributions, 
power corrections in the operator-product expansion become unmanageable. 


Problems 


(1) Velocity in potential models 
Truly nonrelativistic systems have excitation energies small compared to the 
masses of their constituents. However, fitting the observed spectrum of light 
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(2) 


(3) 
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hadrons requires excitation energies comparable to or larger than the con- 
stituent masses. 

Assuming nonrelativistic kinematics, consider a particle of reduced mass m 
moving in a harmonic-oscillator potential of angular frequency w. Expressing 
œw in terms of the energy splitting Eı — Eo between the first-excited state and 
the ground state, use the virial theorem to determine the ‘rms’ velocities of the 
ground state (v\) and of the first-excited state (v(?.) in terms of E; — Eo. 
Compute the magnitude of v /c and v% /c using as inputs (i) my, — m, X 
500 MeV for light hadrons and (ii) mys) — my;y ~ 590 MeV for charmed 
quarks. 

Your results should demonstrate that the kinematics of quarks in light hadrons 
is not truly nonrelativistic. However, one tends to overlook this flaw given the 
potential model’s overall utility. 

Nucleon mass and the Skyrme model 

(a) Use the Skyrme ansatz of Eq. (4.15) to derive the expression Eq. (4.16) for 
the nucleon energy E[F]. 

(b) Using the simple trial function F(r) =x exp(—r/R), scale out the range 
factor R to put E[F] in the form of Eq. (4.11), where a ~ 30.8F? and 
b ~ 44.7/e? are determined via numerical integration. 

(c) Minimize E[F] by varying R and compare your result with the value 
73 Fx /e determined with a more complex variational function. 

(d) Using the numerical value of the nucleon mass, determine e and compare 
with the value 

1 4 1 


32e F2 (nF? 


expected from chiral-scaling arguments. 
A ‘QCD sum rule’ for the isotropic harmonic oscillator 
Consider three-dimensional isotropic harmonic motion with angular frequency 
w of a particle of mass m. 
(a) Using ordinary quantum mechanics or more formal path-integral methods, 
determine the exact Green’s function G (t) for propagation from time t = 0 
to imaginary time t = — it at fixed spatial point x = 0. G(r) is the analog 
of the ‘correlator’ for our quantum mechanical system. 
From the representation G(t) = (0, —it|0, 0), use completeness to express 
G(T) in terms of the S-wave radial wavefunctions {R,,(0)} evaluated at 
the origin and the energy eigenvalues {£,,}. What values of n contribute? 
This representation is the analog of the dispersion relation expression for a 
correlator in which one takes into account an infinity of resonances. 


(b 


wm 
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(c) Plot the negative logarithmic derivative —d[In G(t)]/dt for the range 0 < 


(d 


) 


æt < 5 and interpret the large wt behavior in terms of your result in 
part (b). 

Obtain the first three terms in a power series for —d[In G(t)]/dt, expanded 
about t =0. This is the analog of the series of operator-product ‘power 
corrections’ to —d[In G(t)]/dt. Assume, as is the case in QCD, that you 
know only a limited number of terms in this series, first two terms and then 
four terms. Is there a common range of wt for which (i) your truncated 
series reasonably approximates the exact behavior, and (ii) the approxima- 
tion for keeping just the lowest bound state in part (b) is likewise reason- 
able? It is this compromise between competing demands of the resonance 
and operator-product approximations which must be satisfied in sucessfully 
applying the QCD sum rules to physical systems. 


XII 


Baryon properties 


An important sector of hadron phenomenology is associated with the electroweak 
interactions. Baryons provide a particularly rich source of information, with data on 
vector and axial-vector couplings, magnetic moments, and charge radii. In 
Sect. XII-1, we describe the procedure for computing matrix elements in the con- 
stituent quark model, and then turn to a variety of applications in the succeeding 
sections. ! 


XII-1 Matrix-element computations 


Much of the application of the quark model to physical systems involves the calcu- 
lation of matrix elements. The subject divides naturally into two parts. On the one 
hand, many quantities of interest follow from just the flavor and spin content of the 
hadronic states. On the other, it is often necessary to have a detailed picture of 
the quark spatial wavefunction. 


Flavor and spin matrix elements 


For the first of these, the quark model is particularly appealing because of the intu- 
itive physical picture which it provides. For example, consider the quark content of 
the proton state vector, which we reproduce here from Table XI-2, 


1 tat o Eytt 
|p) = gg ah = U;44 juz] |0). (1.1) 


The first two quarks form a spin-zero, isospin-zero pair with the net spin and 
isospin of the proton being given by the final quark. The prefactor of 1/v/18 
ensures that the state vector has unit normalization. Calculation reveals that one- 
third of the magnitude of this normalization factor comes from the u;u,d, term 


1 The reader can also consult the Ne — œ studies as described in [DaJM 94, Je 98]. 
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Table XII-1. Some baryon octet expectation values. 


p n A xt x => g0 om 
(Q) 1 0 0 1 0 =j 0 = 
(Qo;) j —2/3 —1/3 1 1/3% —1/3 —2/3 —1/3 
(A30;) 5/3 —5/3 2//6 43 0 —4/3 —1/3 1/3 


“The off-diagonal transition X} —> A has |(Qo;)| = 1/v3. 


and two-thirds from the u,u;d, term, i.e. one concludes that ‘the proton is twice 
as likely to be found in the configuration with the u-quark spins aligned than 
anti-aligned’, 


= 2/3 (uyusd,), 
1/3 (uzu, dz). 


The ‘six parts in eighteen’ of the u}u jd, configuration arises entirely from the six 
ways that color can be distributed among three distinct entities. The configuration 
uxUyd, is twice as large due to the presence of two u} states. Similar kinds of 
inferences can be drawn for the remaining baryon state vectors in Table XI-2. 

We can proceed analogously in deriving and interpreting various matrix-element 
relationships. It is instructive to work at first in the limit of SU(3) invariance 
because more predictions become available. The effect of symmetry breaking is 
addressed in Sect. XII—2. Let us consider matrix elements, taken between mem- 
bers of the spin one-half baryon octet, of the operators 


Prob. (1.2) 


squared charge-radius : fas rew' Ow x (Q), 
axial-vector current : fas Wi ysysasw x (A302), 


magnetic moment : fas a x viaQw); «x (Qo). (1.3) 


Along with the definition of each operator is indicated the flavor—spin attribute 
of an individual quark which is being averaged over. For example, a magnetic 
moment is sensitive to the combination Qo, of each quark within the baryon. 
Matrix elements will then be products of such averages times quark wavefunction 
overlap integrals. The flavor—spin averages for the baryon octet are displayed in 
Table XII-1. 

To see how these values are arrived at, let us compute the value 5/3 obtained for 
the proton axial-vector matrix element. For the configuration u,u;d), which occurs 
with a probability of 2/3, the average value of 430, equals (1 + 1 + 1) x 2/3 = 2, 


332 Baryon properties 


whereas for the configuration uu ,d, one finds (1— 1—1) x 1/3 = —1/3. Together 
they sum to the value 5/3. 


Overlaps of spatial wavefunctions 


The spatial description of quark wavefunctions is less well understood than the 
spin/flavor aspect of the phenomenology.” The most extensive studies of the spa- 
tial wavefunctions are associated with matrix elements of currents. Because these 
are bilinear in quark fields and because of the wavefunction normalization con- 
dition, the magnitudes of these amplitudes are constrained to be nearly correct. 
Dimensional matrix elements are primarily governed by the radius of the bound 
state. As long as the proper value is fed into the calculation, the scale should come 
out right. 

As noted in Sect. XI-1, a relativistic quark moving in a spin-independent central 
potential has a ground-state wavefunction of the form 


=( bur) x je (1.4) 
aid L(r)o -x x 


where u, £ signify ‘upper’ and ‘lower’ components. For the bag model, these radial 


(x) 


wavefunctions are just spherical Bessel functions. This form also appears in some 
relativistic harmonic oscillator models, which use a central potential. To charac- 
terize different types of relativistic behavior, it is worthwhile to express matrix 
elements in terms of u and £ without specifying them in detail. The normalization 
condition for the spatial wavefunction is then 


fas w(x) W(x) = fas (u(r) + 27 (r)) = 1. (1.5) 


In the nonrelativistic regime, the lower component vanishes (£ = 0). 
Let us consider the size of the lower components which occur in various 
approaches. In the bag model one obtains for massless quarks the integrated value 


fas L (r) x 0.26. (1.6) 


Relativistic effects are often included in potential models by working in momentum 
space and employing the spinor appropriate for a quark g in momentum 
eigenstate p, 


2 Even the experimental value of the proton charge radius rg is in question. The historical approach, to 
measure the differential cross section in elastic electron—proton scattering at low Q?, gives rg = 0.879(8) fm 
and rg = 0.875(11) fm [Zh et al. 11] in recent experiments. By contrast, measurement of the 
2S ie — 2P3 5" energy difference in muonic hydrogen [An et al. 13] yields (using a consistent definition 
of charge radius) rg = 0.84087(39) fm, which is at 7o variance relative to the scattering value. 
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X 
u(p) = yE +m o-p : (1.7) 
E+m, š 
In this case the relevant prescription is 
p? 
fas eash, (1.8) 
2E(E + mq) 


where the averaging is taken over the momentum-space wavefunction of the par- 
ticular model. Using the uncertainty principle relation of Eq. (XI-1.14) to estimate 
(p°), we find typical values 


2 

__P\ 913 + 0.20 (1.9) 
2E(E +m) 

for a confinement scale of 1 fm. Larger effects are found in the harmonic-oscillator 

model if one uses the value œ? = 0.17 GeV? (see Fig. XI-2). Generally, the lower 

component is found to be significant but not dominant in quark wavefunctions. 


Connection to momentum eigenstates 


In all cases except for the nonrelativistic version of the harmonic oscillator model, 
one cannot explicitly separate out the center-of-mass motion. The result of a quark 
model description of a bound state is a configuration localized in coordinate space, 
i.e., a position eigenstate. However, the analysis of scattering and decay deals with 
the plane waves of momentum eigenstates. 

The basic assumption made in all quark models is that the bound state with a 
given set of quantum numbers is related to only those momentum eigenstates of 
the same type. If we denote |H(x)) as a unit-normalized hadron state centered 
about point x and | H(p)) as a plane-wave state, then we have 


|H(x)) = J d? p o(p)e®™ |H (p)). (1.10) 


We shall give a prescription for obtaining a functional form for g(p) shortly. Let 
us normalize the plane-wave states for both mesons and baryons as 


(HPH (p)) = 2a (277)*5 (p — p). (1.11) 


The constraint of unit normalization then implies 


J d’? p 2wp(27) lpp)? = 1. (1.12) 
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We can employ the above wavepacket description to derive a general procedure 
within the quark model for calculating matrix elements [DoJ 80]. Many matrix 
elements of interest involve a local operator O evaluated between initial and final 
single-hadron states. Let us characterize the magnitude of the matrix element in 
terms of a constant g. Then, for baryons in the momentum basis, the spatial depen- 
dence is given by 


(B'(p') |O(x)| B(p)) = g TPT oup) iPP, (1.13) 


where Ig is a Dirac matrix appropriate for the operator O. By comparison, one 
obtains in any bound-state quark model (QM) calculation a spatial dependence 
whose specific form is model-dependent, 


em (B’|O(x)| B)am = f(x). (1.14) 


Hereafter, let us center all quark model states at the origin. The method of 
wavepackets then implies 


om (B" | | d’x O(x)|B)om = g l ax / d’p'd’p ¢ Np) 
x TPT oup) 


=g / d’ p (22) |p) UMT oup). (1.15) 


For sufficiently heavy bound states the fluctuation in squared momentum (p?) is 
small, and one may expand about |p| = 0, 


Tp) ou (p) = WO) ou(0) + O ((p*)/m5) - (1.16) 
A common approach consists of keeping only the leading term to obtain 


E 
2mpg 


u)r ou(0) = out f a's O(x) |B)om- (1.17) 


It is interesting to note that this relation, often thought of as fundamental, is in fact 
only an approximation. 

As an example, let us perform the complete quark model procedure for the 
neutron—proton axial-vector current matrix element. We begin by defining as usual 


(P(Po, 52) |A Œ) | n(p1, 51)) = BAU (Po, $2) Vu ysu (Pi, s1) PPO Hoe, 
(1.18) 


For spin-up nucleons the choice u = 3 gives 


uO, t)ysysu(0, 1) = 2my, (1.19) 
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yielding for Eq. (1.18) the basic formula, 


8a = om (Ph J d°x W(x)ysysd (x) [n)a (1.20) 
The field operator for any quark q is expanded as in Eq. (XI-1.1), 
qalx) = J) | Vase Gna) + nse" @| (1.21) 


Substituting, we have 


ga = om(Pil [as Wo. (2) ¥3¥5Wo,s(x) u} (s)de(s’)|n4)em, (1.22) 


where only the n = 0 ground-state mode contributes. At this stage, one can factor- 
ize the spin and space components by using the general ground-state wavefunction 
of Eq. (1.4). This leads to 


fas VosV3Vs¥o,s' = fas xi u03 — L0 +B) xy 
=o" l dx (u? = 50) (1.23) 
and thus 
ë= fas G — 5e) am (P+ lu" (s, eo)o$"'d(s", o| n)an: (1.24) 


Finally, upon dealing with the spin dependence in Eq. (1.24), we obtain 


5 1 5 4 
ga = > f a's (« = 56) s; (1 — = fax é). (1.25) 


Any nonrelativistic quark model, having zero lower components, would simply 
yield ga = 5/3. If one desires to make relativistic corrections to such a model, 
the result can be inferred from the above general formula with the appropriate 
substitution of Eq. (1.8). Clearly, the procedure just given can be extended to matrix 
elements of any physical observable. 

The wavepacket formalism also allows for the estimation of the ‘center-of-mass’ 
correction. This arises from the (p?) modifications to Eq. (1.16). For the axial cur- 
rent, the zero-momentum relation in Eq. (1.19) is extended for nonzero momentum 
to 


u2(p, T)yvsysui(p, t) -1 


p? 1 3m 3m, 
2E 3mım2 


ale pe T ie O(p*), (1.26 
tpt amt) + P^, (1.26) 
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where an average over the direction of p has been performed. This expression 
generalizes Eq. (1.25) to 


(Pap /1 3 Mp 3m, 5 Í 3 z l5 
ra = d ee oa 1.27 
sa| 3m,mp a ama | 8m, 3 sae 3 ( ) 


where (P°)np ~ 0.5 GeV? is a typical bag model value. 

It is possible to argue that in the transition from the current quarks of the QCD 
lagrangian to the constituent quarks of the quark model, the couplings to currents 
should be modified. For example, one might suspect that the coupling of a con- 
stituent quark to the axial current occurs not with strength unity, but with a strength 
ae such that the nonrelativistic expectation is not g; = 5/3 but rather g; =5 go /3. 
The choice a ~ 3/4 would then yield the experimental value. This is not unrea- 
sonable but, if fully adopted, leads to a lack of predictivity. In such a picture, not 
only can the magnetic moments and weak couplings be renormalized, but also the 
spin and flavor structures. That is, in the ‘dressing’ process which a constituent 
quark undergoes, there could be ‘sea’ quarks, such that the constituent u quark 
could have gluonic, d-quark, or s-quark content. Likewise, some of the spin of 
the constituent quarks could be carried by gluons. One is then at a loss to know 
how to calculate matrix elements of currents. In practice, however, the naive quark 
model, with no rescaling of ga or of the magnetic moment, does a reasonable job of 
describing current matrix elements. It is then of interest to study both the structure 
and limitations of this simple approach. 


Calculations in the Skyrme model 


There are several differences between taking matrix elements in the quark model 
and in the Skyrme model [Sk 62]. To begin, in the quark model a current is 
expressed in terms of a bilinear covariant in the quark fields (cf. Eq. (1.3)), whereas 
in the Skyrme model the representation of a current is rather different. As an 
example, application of either Noether’s theorem or the external source method 
of Sect. IV—6 identifies the SU (2) vector and axial-vector currents to be 


e i t t 
(4) = ET (U Ut + 3,0" U) 


a/u 
— — [Tr ([r*, 3U Ut] [a U Ut, 3U U*)) 
e 
+Tr ([t", a0 U] [ə Ut U, Ut U])], (1.28) 


where U = A(t)U A7'(t) is the quantized skyrmion form and A(t) is an SU (2) 
matrix. We shall neglect derivatives of A(t), as the quantization hypothesis corre- 
sponds to slow rotations. This leads to a result similar in form to Eq. (1.28), but 
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with U —> Uo and t“ —> A7!(t)t“ A(t). The answer may be simplified by use of 
the explicit form of Up appearing in Eq. (XI-4.15). 

Let us use Eq. (1.28) to compute the spatial integral of the axial current. After 
some algebra, we obtain a product of spatial and internal factors, 


[as (Ja); = -Gs Tr (t Ar A’), 


in2F 4sin2F 
o=-5 f arr |F p sin A sin (F? 


3e? a r 
8sin? F , 4sin? F sin2F 
$F 4 |, (1.29) 
F r 


where a is the isospin component and j is the Lorentz component. This is now 
suitable for taking matrix elements, such as 


(prl J dx (Ja) pr) = J dx / dQ3 (pl A) (Ja); (Alpy) 


(4) 
= Gs f a D De 2 (A) Te Ar ADE? (A), 
32 
(1.30) 
where we have used the completeness relation of Eq. (XI-4.42). Upon ae 
the trace in Eq. (1.30) as a rotation matrix, Tr (t* At! AT!)/2 = DP , we can 


determine the group integration in Eq. (1.30) in terms of SU (2) Clebsch-Gordan 
coefficients, 


1 , 27? n ” 
(Tx T’) (T) — ¢_\2(T'-T +n) PTI" AT'TT 
J am DE ADPADPA) =O ee chr. 


(1.31) 


Alternatively, one can work directly with the collective coordinates, e.g., with the 
aid of Eqs. (XI-4.41-4.44) we obtain fora = j = 3 


a dQ3 (a1 — ian) (a? + a} — a? — a3)(a + iaz) = -Gs. a39 
T 3 

Before one can infer a Skyrme model prediction for g4 from this calculation, 
there is a subtlety not present for the quark calculation, which must be addressed. 
Due to the original chirally invariant lagrangian, the Skyrme model is unique among 
phenomenological models in being completely compatible with the constraints of 
chiral symmetry. As a consequence, the near-static axial-vector matrix element is 
constrained to obey 


q; (P(p')|(Ja);|P(p)) = 0, q=p-p (1.33a) 
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and hence must be of the form [AdNW 83], 


(p(p’)|(Ja)3|p(p)) = 2mpga (èn = ua) (ox). (1.33b) 
The term containing |q|~? arises from the pion pole, as will be discussed in 
Sect. XII-3 in connection with the Goldberger—Treiman relation. An angular aver- 
age of Eq. (1.33b) then yields 2g4/3, which from comparison with Eq. (1.32) 
implies ga = Gs. Thus in the Skyrme model, the axial-vector coupling constant 
equals the radial integral in Eq. (1.29) which defines Gs. Use of the profile given 
in Sect. XI— leads to the prediction gy = 0.61, which is about only one-half 
the experimental value and constitutes a well-known deficiency of skyrmion phe- 
nomenology. Presumably, consideration of a more general chiral lagrangian could 
modify this result by including higher derivative components in the weak current. 
Pions may be added to the Skyrme description through introduction of the matrix 
é described in App. B— [Sc 84], 


U=EAQUA THE, E= explit - x /(2Fz)]. (1.34) 


If currents are formed using this ansatz, some terms occur without derivatives on 
the pion field, while others contain one or more factors of 9” 7 . Since 0“z gives rise 
to a momentum factor g* when matrix elements are taken and soft-pion theorems 
deal with the limit g* — 0, the lowest-order soft-pion contribution will consist 
of keeping only terms without derivatives. Thus in the process v, +N > N + 
x + u the final-state pion is produced by a hadronic weak current and the soft-pion 
theorem relates the N — Nz matrix element to the N — N current form factors. 
Expanding the currents to first order in the pion field yields 


ý i Fr a 4—1 t t 
(4), = [Tr (1A! (ap U$ Uo + 3p Uo UÈ) A) 
iz? 


2Fy 


Tr (ce 7 Aq! (aui Uo F uUo Uj) A) + | , (1.35) 


where for notational simplicity we have displayed only the first term in the current. 
Note the sign flip in the second line. This form is in accord with the soft-pion 
theorem (see App. B-3) 


: N e Db k ea i Itat b " 
Jim (N'a (e)l (J) INO) = — FN) [o O IN) 
cube a) c 
=- (NII) IN@), 030 


where the current commutation rules of App. B-3 have been used. 
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The static properties of baryons can be determined from their coupling to the weak 
and electromagnetic currents. In this section, we shall describe these features in 
terms of the quark model. 


Magnetic moments 


The generic quark model assumption for the magnetic moment is that the individual 
quarks couple independently to a photon probe. For ground-state baryons where all 
the quarks move in relative S waves, the magnetic moment is thus the vector sum 
of the quark magnetic moments, 


3 
Hbaryon = > Mj Oj, (2.1) 
i=l 


where o; is the Pauli matrix representing the spin state of the ith quark and wu; is the 
magnitude of the quark magnetic moment.’ Since the light hadrons contain three 
quark flavors, the most general fitting procedure to the moments of the baryon octet 
will involve the magnetic moments Mu, Hd, Hs- 

It is straightforward to infer baryon magnetic-moment predictions in the quark 
model directly from the state vectors of Table XI—2. For example, we have seen that 
the proton occurs in the two configurations u,u;d) and u,u,d with probabilities 
2/3 and 1/3, respectively. This can be used to carry out the construction defined 
by Eq. (2.1) as follows: 


2 1 
Mp = 3 M(usurdy) + 3 [L(uru dy) 


1 
Bld: (2.2) 


2 1 4 
= zeuu) + (dy) + ze) + u(u;) + w(dy)] = glu 3 


and similarly for the other baryons. Experimental and quark model values are dis- 
played in Table XH-2. 

It is of interest to see how well the assumption of SU(3) symmetry fares. In 
the limit of degenerate quark mass (denoted by a superbar), the quark magnetic 
moments are proportional to the quark electric charges, 


1 
Hd = Ms = =z bu (SU (3) limit), (2.3a) 


3 When referring to the ‘magnetic moment’ of a quantum system, one means the maximum component 
along a quantization axis (often chosen as the 3-axis). Thus, the magnetic moment is sensitive to the third 
component of quark spin as weighted by the quark magnetic moment. 
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Table XII-2. Baryon magnetic moments. 


Mode Experiment“ Quark model Fit A? Fit BS 
up 2.792847386(63) “Hue Hd 2.79 2.79 
is —1.91304275(45) Aa te —1.86 =1,51 
MA —0.613(4) Lbs —0.93 —0.61 
byt 2.458(10) nan 2.79 2.67 
Iuyoal 1.61(8) ar pal 1.61 1.63 
jise —1.160(25) thd- Hs —0.93 —1.09 
[x0 —1.250(14) Aus Hu —1.86 —144 
us- —0.651(3) A Hd —0.93 —0.49 


“Expressed in units of the nucleon magneton wy = eħ/2Mp. 


>SU() symmetric fit. 
“Ly, Hd, Hs taken as independent parameters. 


while isospin symmetry would imply 
1 
la = S 5 Mu (SU (2) limit). (2.3b) 


If we determine the one free parameter by fitting to the very precisely known proton 
moment, we obtain the SU (3) symmetric Fit A shown in Table XII-2. More gener- 
ally, allowing Hu, Ha, Hs to differ and determining them from the proton, neutron, 
and lambda moments yields 


Ly = 1.85 Ln, Ha = —0.972 uy, us = —0.613 uy, (2.4) 


and leads to the improved (but not perfect) agreement of Fit B in Table XII-2. 
We see from Eq. (2.4) that the main effect of SU(3) breaking is to substantially 
reduce the magnetic moment of the strange quark relative to that of the down quark. 
The deviation of j1¢//, from the isospin expectation of fg /U, = —1/2 is smaller 
and perhaps not significant. Observe that the famous prediction of the SU (2) limit, 
Un/Hp = —2/3, is very nearly satisfied. 

The magnetic moment as derived from the multipole expansion of the electric 
current is defined by 


2 


It follows from this expression that the contribution of a nonrelativistic quark ‘q’ 
to the hadronic magnetic moment is just the Dirac result, 


_ 2 
2M, 


1 3 
— J d’x r x Jen): (2.5) 


lq (2.6) 
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where Mọ is the quark’s constituent mass and Q is its charge. We can use this 
together with Eq. (2.4) to determine the constituent quark masses, with the result 


M, ~ Ma ~320MeV, M, ~510 MeV. (2.7) 


As we shall see in Sect. XIII-1, these masses are comparable to those extracted 
from mass spectra of the light hadrons. 

One can also construct models involving relativistic quarks. For these, the 
magnetic-moment contribution of an individual quark becomes 


2 


u= La J d°x r u(r) €(r). (2.8) 


Note the absence of an explicit dependence on quark mass. This is compensated by 
some appropriate dimensional quantity. The inverse radius R~! plays this role in 
the bag model, and other determinations of R allow for a prediction of the hadronic 
magnetic moment. For example, the bag model defined by taking zero quark mass 
(corresponding to the ultrarelativistic limit) and R = 1 fm yields the value u, ~ 
2.5 in a treatment which takes center-of-mass corrections into account [DoJ 80]. 
Although this specific value is somewhat too small, it is fair to say that quark 
models give a reasonable first approximation to baryon magnetic moments. 


Semileptonic matrix elements 


The most general form for the hadronic weak current in the transition B, > Bolv¢ 
is 


(Bo(p2)| J" |B, (pi) 


- an, iha ,, BAÐ 
= u(p2) [ra Yu mm a a oe m 
J iga(q°) ; g3(q°) 
ta y a f 2.9 
+81 (q4 )Yuys PE ys + nema u(pı) (2.9) 


where the {f;} and {g;} form factors correspond respectively to the vector and 
axial-vector current matrix elements, and q = pı — p2 is the momentum trans- 
fer.* The form factors are all functions of q? and the phases are chosen so that each 
form factor is real-valued if time-reversal invariance is respected. In practice, the 
form factors accompanying the two terms with the kinematical factor q„ are dif- 
ficult to observe because each such contribution is multiplied by a (small) lepton 
mass upon being contracted with a leptonic weak current. Thus, we shall drop these 
until Sect. XII—4. 


4 Given the context of application, there should be no confusion between the QCD strong coupling constant g3 
and the axial-vector form factor g3 (q?). 
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As regards the remaining form factors, we have already presented the ingredients 
for performing a quark model analysis (see also [DoGH 865]). Using the n > p 
transition as a prototype, we have 


ki = (pl J dx uy°d |n+) = fas (Uuta + lula) =1, (2.10a) 


apo AD 
Lie = (ml f dx 5 tex @ydh my) 
= 5 f Ex ruta + uate) n ( : + : ) l (2.10b) 
3 2\2M, 2Mı 
“p a 5 f3 1 
& = (nil fa x uy3ysd |ny) = + fa X (Uyug — zluta), (2.10c) 
np np np 
PAT + (= = im) a = (pl — if ax zy ysd |n+) 
= 5 f ax cust — u,lqa) = f ( E J (2.10d) 
3 2\2M, 2M, 


In each case, we first give the defining relation, then the general Dirac wavefunc- 
tion (cf. Eq. (1.4)) and, finally, the nonrelativistic quark model limit. The vanish- 
ing of g,” in the limit of exact isospin symmetry is a consequence of G-parity 
(cf. Sect. V-3)). 

Predictions for the other baryonic transitions are governed by SU (3) invariance, 
amended by small departures from SU (3) invariance as suggested by the quark 
model, i.e., s —> u transitions are similar to those of d —> u as given above, but with 
the down-quark mass and wavefunction replaced by those of the strange quark. 
SU (3) breaking in the form factors arises from this difference in the wavefunction. 
As a quark gets heavier, its wavefunction is more concentrated near the origin 
and the lower component becomes less important. The form factors of the matrix 
element (Bp|J”|Ba) evaluated in the SU (3) limit at q? = 0 give for the vector 
current, 


fi (0) = ifabes f2(0) = i fabe f F daped, 
1 F 3 (2.11la) 
=>, D n7 1), = — z Mn, 
f 5 1 +u ) 5H 
with f/d = 0.29, and for the axial-vector current, 
8100) = i fabe F + davc D, (2.11b) 


np 


with F + D = g} = ga = 1.27. In the above, the indices a, b, c = 1, ..., 8 label 
the SU (3) of flavor, with c = (1 + i2) for AS = 0 and c = 4 + i5 for AS = 1. 
There is no SU (3) parameterization for the g2 form factor because it vanishes in the 
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SU (3) limit. An important result specific to the quark model is D/(D + F) = 3/5 
for the SU (3) structure of the axial-current {g,} form factors. 

SU (3) breaking in the { fı} form factors is required by the Ademollo—Gatto the- 
orem to occur only beginning at second order (see Sect. VII-1). In practice, the 
quark model prediction for SU (3) breaking yields an extremely small effect. This 
is not true for the {f2} form factors of weak magnetism, where inclusion of the 
strange-quark mass lowers all s —> u transitions by 20% compared to the d > u 
transition. The wavefunction overlaps in g; lead to a slight increase in the strength 
of the s — u transition compared to d — u because of the reduced lower compo- 
nent of the s quark. For g2, a nonzero but highly model-dependent value is gener- 
ated, typically of order |g2/g)| = 0.3. 


XII-3 Symmetry properties and masses 


In our discussion of baryon properties, we have first discussed quark models 
because they are generally simple and have predictive power. However, effective 
field theory methods are also useful when applied to the study of baryons.> We 
shall combine the two descriptions in this section. 


Effective lagrangians for baryons 


We begin by writing effective lagrangians which include baryon fields, using the 
procedure described in App. B-4. The lowest-order SU (2)-invariant lagrangian 
describing the nucleon and its pionic couplings has the form 


Ly =Ñ (iP — ga Ays — mo) N 
Zo - . Z\ - 
-— SN a +E) N — an Tr (MU + Uñ), 
Di = +iVy, €=explit-n/2F,)], U=EéEE, 


= Ls ; = bp 7 

V, = z (Eta E + €0,&'), Ay = = (€'0,€ — €0,&"), (3.1) 
where N = (i ) is the nucleon field, m is the mass matrix for current quarks (with 
mM, = mq =m), Zo and Z; are arbitrary constants which parameterize terms pro- 
portional to the quark mass matrix, and the constant g4 is the nucleon axial-vector 
coupling constant gq ~ 1.27 (cf. Prob. XII-1). The mass parameter mo represents 
the nucleon mass in the SU (2) chiral limit. 


5 There is also an effective field theory treatment of the few nucleon case [We 90, Va 08, EpM 12] which helps 
understand nuclei in a systematic manner. 
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For the full SU (3) octet of baryons, the analog of ‘N’ is 


oe T P 
ee a Ba — -= a a A 
B=) MB =| » re. wl (3.2) 
= i m0 — 2A 
kd kd V6 


where the phases have been adjusted to match our quark model phase convention 
of Eq. (XI-1.8). The SU (3) version of Eq. (3.1) becomes 


Ly = Tr (B (iD —m)B — D(By"ys{A,, B}) — F (By"yslA,, B])) 
Z - : 
- a (dn (B{Emé + E'mée*, B}) + fn (B[EméE + E*mé", B])) 
— A Tr (BB) Tr (mU + U'm), (3.3) 


where the covariant derivative is now D,B = 0,B + i[V p, B], & is the SU (3) 
generalization of the quantity in Eq. (3.1) with t replaced by à, m is the diagonal 
SU (3) quark mass matrix, 


m = (ñ, m, Ms 


1 1 
Vas = 3 om + ms)1 + a a Ms)Ag, (3.4) 


and mo is the degenerate baryon mass in the SU (3) chiral limit. Consistency of the 
SU (2) and SU (3) lagrangians requires 
D+ F = ga, dm + fn = 1, 
mo = Mo + Zim, — Zoms( fin = dn). (3.5) 


The description thus far is based on symmetry. It includes quark mass, but not 
higher powers of derivatives. 


Baryon mass splittings and quark masses 


The various parameters (m, ms, Zo etc.) appearing in the chiral lagrangians of 
Eqs. (3.1), (3.3) can be determined from baryon mass and scattering data. In the 
nonstrange sector, the nucleon mass is given in the notation of Eq. (3.1) as 


To isolate the effect of the nonstrange quark mass m and of the constants Zo, Z1, it 
will prove useful to define a quantity o, 
(N|au + dd|N) 


o = my — mo = m— —— = ñ (Zo 4+ 2Z)). (3.7) 
2my 
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Shortly, we shall see how this quantity can be determined from pion—nucleon scat- 
tering data. 

However, let us first consider the baryonic mass splittings generated by the 
mass difference m, — m. Upon using Eq. (3.3) to obtain expressions for the baryon 
masses and working with isospin-averaged masses, it is possible by adopting the 
numerical values 


Zo(ms — m) = 132 MeV, din/ fin = —9.31, (3.8) 
to obtain the following good fit: 
my — my = (fin — dn)Zo(ms — mM) = 251 MeV (expt. : 254.2 MeV), 
4 A 
my — Map = — 34m Zo(ms —m) = 79 MeV (expt. : 77.5 Mev), (3.9) 
Mz — My = 2 fmn Zo(ms — m) = 383 MeV (expt. : 379.2 MeV). 
Observe that these mass splittings depend on Zo but not on Z,. The three relations 
of Eq. (3.9) imply the Gell-Mann—Okubo formula [Ge 61, Ok 62], 
1 3 
my — My = 5 (Ms —my)+ qmx — my) 
(Expt. : 254 MeV = 248 MeV), (3.10) 


which displays an impressive level of agreement (~ 3%) with experimental values. 

The above analysis, based on a chiral lagrangian, can be enhanced by using ideas 
taken from the quark model. In the limit of noninteracting quarks, the quark model 
yields for a general spatial wavefunction,° 


Ma — my = My — My = Mz — my = (m; ~h) f as u =P), (3.11) 


However, observe that my = m, (corresponding in the chiral lagrangian descrip- 
tion to dm = 0) for noninteracting quarks. Of course, the actual A and & baryons 
are not degenerate, so additional physics is required. A quark model source of the 
A — È mass splitting lies in the hyperfine interaction of Eq. (XI—2.14), 


a 1 - 
i = 5 XO Hysi- sj 8), (3.12) 
i<j 


where the prefactor of 1/2 is associated with the color dependence of Eq. (XI-2.4). 
Matrix elements of this operator give rise to the additive mass contributions, 


6 One could equivalently use the language of the potential model, where these baryon mass splittings arise 
from the constituent quark mass difference Ms — M. 


346 Baryon properties 


my =:++— Hon, Maj =: Hons 


(3.13) 


3 3 
8 8 

4l 1 
a 8 


= Fin = sas, mga =" zns T Hss 
where Hij and H;j are related by Hi; = Hi; lY )|? and the subscripts ‘n’, ‘s’ 
denote an interaction involving a nonstrange quark and a strange quark respec- 
tively. For Hnn Æ Hns, the X and A will not be degenerate. Treating both quark 
mass splittings and hyperfine effects as first-order perturbations (e.g. Hss — Hns = 
Hns — Hmn), one obtains quark model mass relations 


my, — my = (m, — ñ) J d’x (u? — 8), 


1 
mzy =mi = z Han = Hans), 


1 
ma — my = z (Hon — Hos) + 2(ms — ñ) J d°x (u? — L) (3.14) 
in accord with the sum rule of Eq. (3.10). These formulae can provide an estimate 
of quark mass. For the usual range of quark model wavefunctions (encompassing 


both bag and potential descriptions), the overlap integral has magnitude 


1 
fas u ne hay a aa (3.15) 


To the extent that this estimate is valid, it produces the values 
ms, — M ~ 230 > 350 MeV, m~11— 14MeV, (3.16) 


where the chiral symmetry mass ratio of Eq. (VII-1.15a) has been used to obtain 
m. In general, quoting absolute values of quark masses is dangerous as one must 
specify how the operator gg, which occurs in the mass term m,qq, has been 
renormalized. It is all too common in the literature to ignore this point by using 
ms — m = m, — my. The values quoted here are actually current-quark mass 
differences, renormalized at a hadronic scale using quark model matrix elements. 
The parameter Zı which appears in the SU (3) lagrangian of Eq. (3.3) is difficult 
to constrain in a quark model. For example, one might consider the matrix element 


(N |msss| N) 


2My 


= Ms (Zi = Zo(fin = dm)) . (3.17) 


The most naive assumption, that (N |m,ss| N) vanishes, would imply Z; = Zo 
(fm — dn) S 1.9 Zo. However, one may legitimately question whether such an 
assumption is reasonable. We shall return to the issue of the ‘strangeness content’ 
of the nucleon later in this section. 
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Goldberger-Treiman relation 


Moving from the study of baryon masses to the topic of interactions, let us consider 
the coupling of pions and nucleons. The SU (2) lagrangian of Eq. (3.1), expanded 
to order 27, becomes 


=- ~ T 
Ly = NGP—my)N + ANyMysSN > dun 
1 - 1 7 
ig V GEXA eg NN (3.18) 
where o is defined in Eq. (3.7). The second term describes the N Nz vertex. Upon 


using Eq. (3.18) to compute the pion emission amplitude N — Nz! and compar- 
ing with the Lorentz invariant form 


My-+nai = —i8znni(p')yst'u(p), (3.19) 
one immediately obtains the Goldberger—Treiman relation [GoT 58], 


SAMN 
F ` 


8xNN = (3.20) 


Inserting the experimental value, g? yy/4m ~ 13.8, for the NN coupling con- 
stant, one finds the Goldberger—Treiman relation to be satisfied to about 2.5%. 

There also exist important implications for the g3 term in the general expression 
given in Eq. (2.9) for the axial-current matrix element. In forming the n — p axial 
matrix element, one encounters a direct y,, ys contribution and also a pion-pole term 
which corresponds to pion propagation from the n —> pr~ emission vertex to the 
axial current. Making use of Eq. (3.20) and Prob. XII-1, we have 


(p(P’)IAzIn(p)) = up’) sar = 


=a 2mng 
=u(p) savor + TENSA guy u(p). (3.21) 


where g = p — p’. It is this induced pseudoscalar modification which allows the 
axial current to be conserved in the chiral limit m2 —> 0, 


q? 


—id"(p(p’)|Ajln(p)) = 2mnga f = | U(p')ysu(p) 


2mn gam _ 
= =e Rees (3.22) 
Note that for nonzero pion mass, the above is consistent with the PCAC relation of 
Eq. (B-3.7), 


Fm, n" = 0" AÑ, (3.23) 
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as both sides have the same matrix element, 


—i(p(p') | Frm22* (0)| n(p)) = iv2 gan nEP yu) x V2 Fam 


T 


2myg m _ ; 
= ap’) ysu(p). (3.24) 
q“ — Mr 


The pion-pole contribution of the axial-vector current-matrix element has been 
probed in nuclear muon capture, as will be described in Sect. XII—4. 


The nucleon sigma term 


One of the features immediately apparent from the effective lagrangian of Eq. (3.1) 
is that all the couplings of pions to nucleons, with the exception of the quark mass 
terms, are derivative couplings. Before turning to the sigma term, which appears 
in the nonderivative sector, let us briefly consider the expansion in powers of the 
number of derivatives for pion-nucleon scattering. Recall for pion—pion scattering 
(cf. Sect. VI-4), there were no large masses and the chiral expansion was expressed 
in terms of m2 or EZ. However, correction terms in the chiral expansion for nucle- 
ons will enter at relatively low energies since a term like 2p -q ~ 2m,E, can 
get large quickly (it is linear in the energy and has a large coefficient, e.g., Ex = 
250 MeV gives 2m,E, = (700 MeV)?). To combat this difficulty, additional (but 
still general) inputs such as analyticity and crossing symmetry are often invoked. 
Fortified with these theoretical constraints, one then matches intermediate-energy 
data to the low-energy chiral parameterizations. The low-energy chiral results 
thereby obtained appear to be well satisfied [H6 83, GaSS 88]. 

The nonderivative pion—nucleon coupling coming from the quark mass terms in 
Eq. (3.1) is of particular interest. To determine this contribution from experiment, 
one must be able to suppress the various derivative couplings. Thus, if one extra- 
polated in the chiral limit to zero four-momentum, the derivative couplings would 
vanish. Not surprisingly then, a soft-pion analysis reveals that the nonderivative 
coupling can be isolated by extrapolating the isospin-even z N scattering amplitude 
with the Born term subtracted (called D* in the literature) to the so-called “Cheng— 
Dashen point? t = m2, s = m4, [ChD 71]. It is conventional to multiply the 
extrapolated amplitude by F2 and thus define a quantity ©, 


X = F? Dep. (3.25) 
To lowest order in the chiral expansion, the measured quantity & is just the matrix 
element o defined in Eq. (3.7), 
(N|au + dd|N) 


L=6 =n-—____—. (3.26) 
2mn 
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It is this isospin-even scattering amplitude D+ which provides a unique window 
on the nonstrange quark mass m. Because © is proportional to the small mass m, 
it is difficult to determine this quantity precisely, and considerable effort has gone 
into its extraction. The Cheng—Dashen point lies outside the physical kinematic 
region, and extrapolation from the experimental region must be done carefully with 
dispersion relations. A recent estimate is [AICO 13] 


E =59+7MeV. (3.27) 


The result o = £ — 15 MeV has been obtained from studies of higher-order chiral 
corrections, implying 


o ~ 44 MeV (3.28) 


as the measure of light-quark mass [GaLS 91]. 


Strangeness in the nucleon 


In light of the above discussion, it is tempting to interpret various contributions to 
the nucleon mass by making use of the energy-momentum trace. Recall the trace 
anomaly of Eq. (III-4.16), 


Bocp 


0” = 
7 283 


ki F’ 4m uu + madd + mss. (3.29) 


Taking the nucleon matrix element gives 


(N [opi] N) 

my = —— =m +9, 

2my 
mo = (2my) (N foc F2, F*"” + m,ss| N) ~ 894 +8MeV, 

83 
a 
pag OOA cae (3.30) 
2myN 


This result is already quite interesting in that the largest contributions, the gluon 
and strange-quark terms in mo, appear to be ‘nonvalence’. At this stage, the sepa- 
ration is essentially model-independent. 

One can explore the ‘strangeness content of the nucleon’ by using an SU (3) 
analysis of hyperon masses. Thus, we introduce a mass-splitting operator, which 
transforms as the eighth component of an octet, 


1 = 
a 3m —m,)(uu + dd — 25s). (3.31) 
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Since the hyperon mass splittings are governed by this octet operator, we find 


s —m)(au + dd — 25 3 

j ON aes EN te) = Se, A) 
2mp 2 

When scaled by the quark mass ratio m/m,, Eq. (3.32a) becomes 

, (N lau + dd — 255| N) 

m 


2mn 


ô= 


3 2 
= =" (ms — my) ~ 25MeV (35MeV), (3.32b) 
2m% — m2 
where the figure in parentheses includes higher-order chiral corrections [Ga 87]. 
Comparison of ô and ø immediately indicates that they are compatible only if the 


strange-quark matrix element does not vanish. Indeed, one requires 
(N |ss| N) 

(N |ūu + dd + 55| N) 

This gives for the constant Z, of Eq. (3.1) the value Z; ~ 3.9Zọ (2.9 Zo) to be 


contrasted with the estimate which follows Eq. (3.17). At the same time, one can 
separate out the following matrix elements 


~ 0.18 (0.09). (3.33) 


Bocp 
283 


(2my)~'(N |ms5s| N) ~ 260MeV (130MeV), (3.34) 


(2my)'(N 


N) ~ 634MeV (764MeV), 


a papv 
ie 


where figures in brackets use the corresponding bracketed quantity in Eq. (3.32b). 
Note the surprisingly large effect of the strange quarks. These results are contro- 
versial because they draw a counter-intuitive conclusion from the use of SU (3) 
symmetry. However, even with SU (3) breaking, the difference between o and the 
SU (3) value of ô is large enough that some ss contribution is likely to be required. 

This analysis does not go well with the naive interpretation of the quark model 
as embodied, for example, in the proton-state vector formula which began this 
chapter. However, it is possibly compatible with a more sophisticated interpretation 
of the constituent quarks which enter into quark models. In the process of forming 
a constituent quark, the quark is ‘dressed’ by gluonic and even ss quark fields. It 
is no longer the naive object that occurs in the QCD lagrangian. It is this dressed 
object which may then easily generate gluonic and perhaps strange quark matrix 
elements. Recall that even the vacuum state has gluonic and quark matrix elements. 
Similar explanations exist in bag and Skyrme models [DON 86]. This issue remains 
unresolved at present. 

Based on the possible existence of a substantial nonzero value for the scalar 
density matrix element (N|ss|N), a major program was launched to investigate 
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the possibility for a similar nonzero value for the strange vector-current matrix 
element (N|sSy,s|N) which can be characterized in terms of charge and magnetic 
form factors F;(q7), F; (q?) via 


(NIšyusIN) = ū(p’) Pace - wa 0) u(p). (3.35) 
The form factor FY (q?) obeys F i (0) = 0, whereas F} (q?) has no such constraint.’ 
In order to determine the size of the uy,u, d Yud, SY,S contributions to the cor- 
responding nucleon matrix elements, three experimental inputs are required. Two 
of these come from well-known electromagnetic form factors of the proton and 
neutron. The third can be found by performing parity-violating electron-scattering 
experiments from the proton, by measuring the difference in the cross sections for 
the scattering of electrons with left- and right-handed helicities. This is sensitive to 
the strange-quark current because the electromagnetic current and the neutral weak 
current involve strange quarks with different strengths. In this case there exists an 
interference between the electromagnetic (y-exchange) and weak (Z°-exchange) 
contributions and the resultant asymmetry will have the form 


Z dor = do, Gq 
~ dort+do; 4ra 


where Mg, Mm involve the interference of the electromagnetic and electric, mag- 
netic weak form factors and the ellipses indicate a small piece involving the vec- 
tor electron coupling and the axial current. In this asymmetry, the electron side 
involves an axial current while the nucleon side involves a vector current. This 
asymmetry has been studied as a function of q? in a series of experiments at elec- 
tron laboratories at MIT-Bates, at Jefferson Laboratory, and at the Mainz microtron. 
The result is that no signal for a strange vector-current matrix element has been 
seen and limits have been placed on the strange form factors. Numerically, strange 
quarks contribute less than 5% of the mean square charge radius and less than 10% 
of the magnetic moment of the proton. Reviews of this body of work can be found 
in [ArM 12] and [BeH 01]. 


ALR (Me + Mu+:-), (3.36) 


Quarks and nucleon spin structure 


The constituent quark model provides a simple picture of the contents of baryons as 
systems composed of three constituent quarks and nothing else. A rigorous descrip- 
tion using the quark and gluon degrees of freedom which appear in the fandamental 


7 The condition on F i (0) is a consequence of current conservation. Equivalently, taking u = 0 in Eq. (3.35) 


and integrating over the proton volume, one encounters the strangeness ‘charge’ S = j. d?x st (x)s(x) and 
S| N) = 0 since the nucleon carries no net strangeness. 
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lagrangian is in general more complex, but it is often nevertheless instructive to 
explore the constituent picture of a given observable. An interesting example is the 
spin structure of the nucleon. 

For any Lorentz invariant theory, Noether’s theorem requires that there exist 
an angular momentum tensor M“°? which is conserved (0,M uaß — ()) and which 
gives rise to three angular momentum charges associated with rotational invariance, 


J® = J d?x M°? (x), (3.37) 


In the rest frame of a particle, the {J} are related to the three components of 
angular momenta via 


a eo ee 
Ji = P, (3.38) 
For the example of a free fermion, the above quantities take the form 
= 1 - 
MH? = iy" (x*9? — xP 9%) wt seta, (3.39) 


up to total derivatives which do not contribute to the charges, and 


J= fas [-iv' (x x dwt bre | =L+S. (3.40) 


The two contributions in Eq. (3.40) may be labeled the orbital and spin components 
of the angular momentum. 

The quarks in the Noether current are lagrangian (current) quarks, not con- 
stituent quarks. Nevertheless, in the spirit of the quark model let us apply Eq. (3.40) 
to the quarks in a spin-up proton. As expressed in terms of upper (u) and lower (£) 
components (cf. Eq. (XI-1.13)), the orbital and spin contributions are found to be 


_2 3 2 = a 2 1), (0) 
L =F f xeo), S) = fas (« 0) 7 (3.41) 


Aside from the factor 1/2 occurring in ø /2, the quark spin contribution to S is just 
the axial-vector matrix element of Eq. (1.24), whereas the orbital angular momen- 
tum contains just the lower component £ because the x x ð operator has a nonzero 
effect only when acting on the ø -$ factor in the lower component of Eq. (XI-1.13). 
Observe that the orbital angular momentum is nonvanishing and proportional to the 
quark spin. The spin and orbital portions for the individual u, d flavors are easily 
computed to yield 


2 1 1 1 
(S@) = = [a's G — 5e) , (s8) = -z [a's (« — 5e) 
3 : 3 (3.42) 


(LM) = 5 fas l, (LM) = -5 f ax e. 
Z 9 & 9 
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A first lesson is that, despite the spin wavefunction of the protons being written 
entirely in terms of quarks as in Table XI-2, the quark spin averages of Eq. (3.42) 
do not add up to yield the proton spin. The sum is reduced from the anticipated 
value of 1/2 by the lower component £ in the Dirac spinor. It is the total angular 
momentum J which has the expected result, 


(J) = (0), (3.43) 


but the total is split up between the orbital and spin components. The bag model, 
for example, yields 


(S) > 0.65 (J), (3.44) 


so about 35% of the nucleon spin arises from orbital angular momentum. 

Of course, QCD is a full interacting theory and the discussion of the angular 
momenta of the quarks and the gluons cannot be fully separated because these 
fields interact with each other. The total angular momentum can be decomposed 
into several terms, including the interactions between the fields [JaM 90]. These 
can be grouped in various ways. In the current quark—gluon description, it is com- 
mon to write 

i= E, ; (3.45) 
where S,, Lq are the spin and angular momentum components carried by the 
quarks and J, is that carried by the gluons. Thus, we have 


3 2 : 
n= fax |w Stu! xx ciy], 
J; = fas x x (E x B), (3.46) 


where È is the usual Dirac spin matrix and D,Y = [0, +igA,]¥ is the covariant 
derivative of y and therefore, in this definition, the quark angular momentum has 
a gluonic component [JiTH 96]. 

Polarized deep-inelastic electron scattering from the nucleon can measure spin 
effects of the quarks. The study of spin dependent deep inelastic scattering involves 
the antisymmetric component of the nucleon tensor,® which can be written in the 
form 


1 - 
wI = = I deep Gd, Ole) 
sÊ myvs® — s - qp? 
= -ienasi (010, O56. |: (3.47) 
My My 


8 More details can be found in the review [Ba 05]. 
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where v = p-q/my and Q? = —q?. The scaling behavior of the two structure 
functions is 


2 
git, 0) =—Giv, 0), gaos (>) Ga, O G48) 
My My 


where x = Q?/2myv is the Bjorken scaling variable. In the parton model, neglect- 
ing QCD renormalization, one determines 
1 1 
_ 2 g®) (8) (0) 
[ ago Q’) = 5 eda = si O+ se8O +580 (3.49) 
where ge ‘a i gl X ) are the isovector, SU (3) octet, and flavor-singlet axial charges 
iespectively. The axial charges are written in terms of their quark spin content as 


2mnspAgq = (P, S| YuYs4lp, $), (3.50) 


with 
1 
Aq = J dx (qx(x) — q, x)), (3.51) 
0 


where q;(x) is the parton distribution function carrying spin s. In terms of the light 
quarks we have then 


) Au— Ad, g® =Au+Ad—2As, g® =Au+Ad+ As. (3.52) 
The first two of these are well defined from the study of hyperon beta decay, 


go = F + D = 1.27 + 0.003 (from neutron beta decay) , 
” =3F —-D=0.58 + 0.03 (from semileptonic hyperon decay). (3.53) 


The first of these is directly measured and the second comes from an SU (3) rotation 
from the values that are obtained in an SU (3) fit to AS = 1 hyperon decay. Such 
a partonic analysis leads to a decomposition, Au = 0.84 + 0.01 + 0.02, Ad = 
—0.43 + 0.01 + 0.02 and As = —0.08 + 0.01 + 0.02, where these numbers are 
from recent COMPASS data [Qu 12]. The sum of these, Au + Ad + As ~ 0.33, 
is about half of what would be expected for the nucleon spin in the naive quark 
model, Eq. (3.44), and of course the quark model predicts that the As should be 
Zero. 

However, there is reason for caution in this interpretation. The singlet axial 
current, 


J = üy, ysu + dy,ysd + SYpVss, (3.54) 


whose matrix element is said to be represented by Au + Ad + As, is anomalous, 
as seen in Sect. II-3. This has important consequences [AIR 88, Sh 08]. While 
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the axial currents which transform as SU (3) octets have only finite multiplica- 
tive renormalization, the singlet current mixes with gluonic fields under radia- 
tive corrections. Different renormalization schemes yield different mixtures of the 
quark and gluon components [Sh 08]. Moreover, the quark component is not scale- 
independent; there is renormalization group running as a function of Q?. Note that 
the other currents do not suffer from these problems. In particular, the Bjorken 
sum rule [Bj 66] involves the difference of the proton and neutron matrix elements, 
which then cancels out the isosinglet contributions, such that the first moment is 
independent of Q?, 


1 

= 1 

| dx g?" (x, Q) = 28 (3.55) 
0 


This sum rule yields a value go = 1.28 + 0.07 £0.01, which agrees well with the 
number 2 = 1.270 + 0.003 measured in neutron beta decay. The anomaly in the 
singlet current complicates the discussion of the quark contribution to the proton 
spin. 

The partonic analysis of the quark spins has led to further studies. Attempts at 
the experimental study of the gluonic contributions has revealed only a small con- 
tribution to the nucleon spin [AiBHM 13]. There may be the possibility of studying 
the angular-momentum components through the concept of generalized parton dis- 
tributions [Ji 94]. However, the experimental determination of these generalized 
parton distributions is yet to be achieved. 


XII-4 Nuclear weak processes 


One area in which the structure of the weak hadronic current has received a great 
deal of attention is that of nuclear beta decay and muon capture. Although in 
some sense this represents simply a nuclear modification of the basic weak tran- 
sitions n > p+e +e, p > n+ et + ve, the use of nuclei allows spe- 
cific features to be accented by the choice of levels possessing particular spins 
and/or parities [Ho 89]. Here, we shall confine our attention to allowed decays 
(AJ = 0, 1, no parity change) and will emphasize those aspects which stress 
the structure of the weak current rather than that of the nucleus itself. In particular, 
nuclear beta decay provides the best determination of Vaa, while muon capture pro- 
vides the only measurement of the pseudoscalar axial weak form factor predicted 
by chiral symmetry. 


Measurement of Vua 


There are many occurrences in nuclei of an isotriplet of J’ = O% states. Exam- 
ples are found with A = 10, 14, 26, 34, 42, .... Because Coulombic effects raise 
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the mass of the proton-rich /;=1 state with respect to that with /;=0, the positron 
emission process N, (J, = 1) > No, = 0)+e* +», can occur. These transitions 
are particularly clean theoretically, and this is the reason why they are important. 
Since the transition is 0 — O*, only the vector current is involved, and because 
of the lack of spin there can be no weak magnetic form factor. The vector-current 
matrix element involves but a single form factor a(q7), 


(No(p2)|VulNi(p1)) = a(q?)(pi + Pr) (4.1) 


This form factor is known at q° = 0 because the charged vector weak current V, 
is just the isospin rotation of the electromagnetic current, 


U, J£] = dy"u. (4.2) 


This relation is often called the conserved vector current hypothesis or CVC, and 
requires for each of the Ot —> 0O* transitions, 


a(0) = V2. (4.3) 


What is generally quoted for such decays is the Ft, /2 value, essentially the half- 
life tı;2 multiplied by the (kinematic) phase space factor f plus various radiative 
and Coulomb corrections [WiM 72]. Theoretically, one expects a universal form 


273 In2 q 
TMS mV (1- 6MM), A) 
which should be identical for each isotriplet transition. G,, is the weak decay con- 
stant measured in muon decay while the logarithmic correction arises from ‘hard’ - 
photon corrections, as discussed in Chap. VII. The ‘soft’-photon piece as well as 
finite-size and Coulombic corrections are contained in the phase space factor F. 
Much careful experimental and theoretical study has been given to this problem, 
and the current situation is summarized in Table XII-3 where the experimental 
Ftıj2 values are tabulated. A fit to these and additional Fermi decays produces 
the value Ftiı;2 = 3072.08 + 0.79 s with chi-squared per degree of freedom 
x7/v = 0.28. This excellent agreement over a wide range of Z values is evidence 
that soft-photon corrections are under control. 

Comparison of the experimental Ftı;2 value with the theoretical expression 
given in Eq. (4.4) yields the determination 


Vaa = 0.97425(22), (4.5) 


which makes Vaa the most precisely measured component of the CKM matrix. 
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Table XII-3. Energy release and Ftp values for 
O+ — 0+ Fermi decays [HaT 09]. 


Nucleus Eo(KeV) Fty/2 (s) 

ug 885.87(11) 3076.7(4.6) 
140 1809.24(23) 3071.5(3.3) 
26m A] 3210.66(06) 3072.4(1.4) 
34C] 4469.64(23) 3070.2(2.1) 
38mK 5022.40(11) 3072.5(2.4) 
42S¢ 5404.28(30) 3072.4(2.7) 
46y 6030.49(16) 3073.3(2.7) 
50Mn 6612.45(07) 3070.9(2.8) 
54Co 7222.37(28) 3069.9(3.2) 


The pseudoscalar axial form factor 


Chiral symmetry predicts a rather striking result for the form factor g3(q°) of 
Eq. (2.9), namely that it is determined by the pion pole with a coupling fixed by 
the PCAC condition. One cannot detect this term in either neutron or nuclear beta 
decay because when the full matrix element is taken, one obtains 


83Me_ 


8 sg TPC — ys)u (Pe), (4.6) 
MN 


7 > T(p)qu y" (1 + ys)u(pe) = 
MN 


which is proportional to the electron mass and is thus too small to be seen (effects 
in the spectra are O(m2/myE.) « 1). However, in the muon capture process 
WT p — v,n, the corresponding effect is O(m,,/my) ~ 10%. Thus, muon capture 
is a feasible arena in which to study the chiral symmetry prediction [CzM 07]. The 
drawback in this case is that typically one has available from experiment only a 
single number, the capture rate. In order to interpret such experiments, one needs 
to know the value of each nuclear form factor at q? ~ —0.9 m?, which intro- 
duces some uncertainty since these quantities are determined in beta decay only 
at q? ~ 0. Nevertheless, predicted and experimental capture rates are generally in 
good agreement provided one assumes (i) the q? ~ 0 value of form factors from 
the analogous beta decay, (ii) q? dependence of form factors from CVC and elec- 
tron scattering results, (iii) the CVC value for the weak magnetic term fz, and (iv) 
the PCAC value of Eq. (3.21) for g3. The results are summarized in Table XII-4. 
Obviously, agreement is good except for °Li, for which the origin of the discrep- 
ancy is unknown, although it has been speculated that perhaps the spin mixture is 
not statistical. Also, in the case of *He there remains a small disagreement between 
the elementary particle model (EPM) and impulse approximation (IA) predictions 
for the capture rate. 
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Table XII-4. Muon capture rates. 


Reaction Theory (103 s7!) Experiment (103 s7!) 
Bo + povyta 0.712 + 0.005¢ 0.715 + 0.005 + 0.0054 
uT + 7He > vy + °H 1.537 -0.022™ 1.496 + 0.004 
1.506 + 0.0154 

m ! +0.33 
uT + Li > v, + He 0.98 + 0.15 1.60" 9 19 

= 2 12 4 +03 
uT + PC> v, + 2B 7.01 +0.16 6.75 "9.75 
aS ip À; 


Before proceeding, we should emphasize one relevant point. When PCAC is 
applied, it is for the nucleon 


2 D= 
2mygi(q?) — 93(q°) = 2Fr8ann (4°) (1-4) ' (4.7) 


2mNn 
Then, at q? = 0, we have 


gann (MŽ) = 


Fr 
1.27 = g1(0) > 1.30, (4.8) 


My 


which is the Goldberger-Treiman relation. On the other hand, taking similar q? 
dependence for gı (q?) and gynn (q°), we find 


My g3(—0.9m2 ) 2mymy, 1 : 
~ - ~ 6.45. 4.9 
2my g1(—0.9m?,) m? + 0.9m? z aMMy (4.9) 


PCAC is generally applied in nuclei in the context of a simple impulse approxi- 
mation, and it is this version of PCAC which is tested by the muon capture rates 
listed in Table XII—4. The direct application of PCAC in nuclei cannot generally 
be utilized since the pion couplings are unknown. 

In the case of muon capture on !?C, additional experimental data are available. 
One class of experiment involves measurement of the polarization of the recoiling 
12B nucleus. Combining this measurement with that of the total capture rate yields 
a separate test of CVC as well as of PCAC. The results, 


expt 


—0.9m2 
ins, 22a 
ty 2mn 81 (—0.9m7) 


=8.0 3.0, (4.10) 


are in good agreement with both symmetry assumptions. 
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In addition, one can measure the average and longitudinal recoil polarizations in 
the !?C muon capture, yielding a value for the induced pseudoscalar coupling, 

My 83 (—0.9m? ) 

2my 81 (—0.9m7) 


= 9.0+1.7, (4.11) 


which is again in good agreement with PCAC. 
The most precise value comes from the recent measurement of the singlet-muon 
capture rate in hydrogen, which yields 


m, 83(—0.9m7,) 
2mn 81 (—0.9m7,) 


= 5.75 + 0.95, (4.12) 


which is excellent agreement with PCAC. 


XII-5 Hyperon semileptonic decay 


The goals in studying hyperon semileptonic processes are to confirm the value of 
Vas Obtained in kaon decay and to use the form factors to better understand hadronic 
structure. These two goals are interconnected. In earlier days when data were not 
very precise, fits to hyperon decays were made under the assumption of perfect 
SU (3) invariance in order to extract Vąs. Presently, the experiments are precise 
enough that exact SU (3) no longer provides an acceptable fit. The desire to learn 
about V,, is thus impacted by the need to understand the SU (3) breaking. 

We have already described in Sect. XII-1 the physics ingredients which lead to 
SU (3) breaking within a simple quark description. These include recoil or center- 
of-mass corrections, wavefunction mismatch (in which a normalization condition 
realized in the symmetry limit no longer holds), and generation of the axial form 
factor go. For hyperons, because of the presence of the axial current, SU (3) break- 
ing can occur in first order. This means that hyperon decays are more difficult to 
use for determining Vas than are kaon decays, where the Ademollo—Gatto theo- 
rem reduces the amount of symmetry breaking. Thus, at the moment it is probably 
best to use the value of V,, determined from kaon decay, and require that hyperon 
decays yield a consistent value. 

The clearest evidence on SU (3) breaking comes from the X= —> A +e7 + 
Ve rate. Since this is a AS = 0 process, Vas does not enter and, in addition, the 
vector current matrix element must vanish. Thus, the rate is determined by the 
axial-current contribution alone, for which the theoretical prediction is 


7 2 
ae np (5.1) 
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where p is a SU(3) breaking factor due to the center-of-mass effect. A bag-model 
estimate yields p = 0.939. Taking p = 1, the best SU (3) symmetric fit to all the 
data [CaSW 03] would require D/(D + F) = 0.635 + 0.006, and hence g? ^ = 
0.658 if SU(3) were exact. On the other hand, the data on &~ — Aev, requires 
g> ^ = 0.591 + 0.014, which implies the correction pọ = 0.931 + 0.022. There 
seems to be no way to avoid this need for SU (3) breaking. 

The full pattern of SU (3) breaking is more difficult to uncover. One problem is 
experimental. When the g; values are extracted from the data, they have generally 
been analyzed under the assumptions that the fı and f2 form factors have exactly 
their SU (3) values and that g) = 0. If these assumptions are not correct, then the 
values cited in [RPP 12] do not reflect the true gı but rather some combination 
of g1, fi, f2, and g2. The correlation with g% is particularly strong. Thus, quoted 
values of g, must be treated with caution. 

The present status of these decays is reviewed in [CaS W 03]. The data can be fit 
well either by the center-of-mass correction described above, with g2 = 0, or by 
the full corrections including wavefunction mismatch, with g2/g, = 0.20 + 0.07 
in A —> p +e + ve. Without an independent measurement of g2 one cannot decide 
between these. We note, however, that either option yields a value of Vas consistent 
with that found in kaon decays, 


Vas = 0.2250 + 0.0027. (5.2) 


XII-6 Nonleptonic decay 


The dominant decays of hyperons are the nonleptonic B —> B’m modes. Because 
of the spin of the baryons and the many decay modes available, the nonleptonic 
hyperon decays present a richer opportunity for study than do the nonleptonic kaon 
decays. 


Phenomenology 
The B — B'r matrix elements can be written in the form 
MB>B'z = u(p’) [A + Bys] u(p), (6.1) 


with parity-violating (A) and parity-conserving (B) amplitudes. Watson’s theorem 
implies that if CP is conserved, the phase of these amplitudes is given by the strong 
B'r scattering phase shifts in the final-state S wave (for A) or P wave (for B), i.e., 


A = Ao exp (i63,,.), B = Bo exp (idp,,), (6.2) 
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with Ao, Bo real. Aside from the xN system, these phase shifts are not known 
precisely, but are estimated to be ~ 10° in magnitude. The decay rate is expressed 
in terms of the partial wave amplitudes by 
Iq|(E" + mp’) = 
Ppa = —7———_ (IAP + 1B’), (6.3) 
4am pz 
where q is the pion momentum in the parent rest frame and we define B = (E’ — 
mp: /E' + mg’)'/*B. Additional observables are the decay distribution W (0), 


W(0)=1+aP,-p eee) (6.4) 
= a . A œa = —, 5 
as AP + IBP 
and the polarization (Pp) of the final-state baryon, 
Pp) = (a+ Ps - ps) bs + B (Ps x pa) +y [Pe x (Pa x Be’) | 
2Im(A* B A|? — |B}? 
je a a a 
|A? + |B}? |A|? + |B}? 


where P} is the polarization of B and pz is a unit vector in the direction of motion 
of B’. Experimental studies of these distributions lead to the amplitudes listed in 
Table XII-5. 

The nonleptonic amplitudes may be decomposed into isospin components in a 
notation where superscripts refer to AJ = 1/2, 3/2, 


Aj->pr- = V2 AW = AY’, Ay-snn- = AS? + Ay’ 
— a) (3) 1 2 
Aj-snx® = ZAR E V2 AS > Ast ionnt = 34D = L A9 + Xr, (6.6) 
= a) (3) ` 
Ax0_,an09 = —Ag — V2AS ? JA S 240 442) X 
Assan- =V2A® — A® PaT ee TA 


and X» is of mixed symmetry. Similar relations hold for the B amplitudes. From 
the entries in Table XII—5 it is not hard to see that the AJ = 1/2 rule, described 
previously for kaon decays, is also present here. Table XII-6 illustrates that the 
dominance of AJ = 1/2 amplitudes compared to those with AJ = 3/2 holds 
in the six possible tests in S-wave and P-wave hyperon decay, at about the same 
level (several per cent) as occurs in kaon decay.’ Thus, the AJ = 1/2 rule is 
not an accident of kaon physics, but is rather a universal feature of nonleptonic 
decays. This makes the failure to clearly understand it all the more frustrating. 


9 For P waves, the observed smallness of By —_,n7— indicates that BY is small, presumably accidentally so. 


In this case the measure of AJ = 3/2 to AJ = 1/2 effects is given by BO) /Xy. 


362 Baryon properties 


Table XII-5. Hyperon decay amplitudes.“ 


A amplitudes B amplitudes 
Mode Expt. Thy.? Expt. Thy. 
A > pr” 3.25 3.38 22.1 23.0 
A > nn? —2.37 —2.39 —-15.8 —16.0 
xt > nat 0.13 0.00 42.2 4.3 
Et > pr® —3.27 —3.18 26.6 10.0 
D> > nn 4.27 4.50 —1.44 —10.0 
5° > An? 3.43 3.14 —12.3 3.3 
a7 > An —4.51 —4.45 16.6 —4.7 


“Tn units of 1077. 
>Lowest-order chiral fit. 


The assumption that the dominant AJ = 1/2 hamiltonian is a member of an SU (3) 
octet leads to an additional formula, called the Lee-Sugawara relation, 


V3 Az+ pro = 2Ag-+an- + Aa pr- (6.7) 


which also is well satisfied by the data. In this case, the corresponding formula for 
the B amplitudes is not a symmetry prediction [MaRR 69], although for unknown 
reasons it is in qualitative accord there also. 


Lowest-order chiral analysis 


Chiral symmetry provides a description of hyperon nonleptonic decay, which is 
of mixed success when truncated at lowest order in the energy expansion. Given 
our comments on the convergence of the energy expansion for baryons made in 
Sect. XII-3, the need for corrections to the lowest-order results is not surprising. 
We shall present the lowest-order analysis here, as it forms the starting point for 
most theoretical analyses. 

Recalling from Sect. IV-7 the procedure for adding baryons to the chiral anal- 
ysis, one finds that the two following nonderivative lagrangians have the chiral 
(81, 1r) transformation property: 


Table XII-6. Ratio of AI = 3/2, 1/2 amplitudes. 


S wave P wave 
A 0.014 0.006 
x —0.017 —0.047 
E 0.034 0.023 
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Fig. XII-1 P-wave hyperon decay amplitudes. 


Lip = DTr (B{£t6£, B}) + F Tr (B [£"A6£, B]), 
LY = Ds Tr (Bys{E Ast, B)) + Fs Tr (Bys [EtA6£, B]), (6.8) 


where £, B are defined in Eqs. (3.1), (3.2) respectively. However, the operator i 
must vanish, as it has the wrong transformation property under CP [LeS 64]. That 
is, a CP transformation implies 


B > (iņB), H> (Y, (6.9) 


and including the anticommutation of B and B, L is seen to return to itself, 
but Le changes sign and hence must vanish. This leaves T as the only allowed 
chiral lagrangian at lowest order. Observe that pS lacks a ys factor. Thus, its B —> 
B'r matrix elements will be parity-violating, leading to only A amplitudes. The 
parity-conserving B amplitudes in B —> B’z are produced through pole diagrams 
as in Fig. XII-1, and are proportional to the parity-conserving B — B’ matrix 
elements of Lo, 

The counting of powers of energy (momentum transfer) in the energy expansion 
goes as follows. Both the B —> B’ transition and the A amplitudes in B —> B'r are 
obtained as matrix elements of Lo , which is zeroth order in the energy. The pole 
diagrams are likewise of zeroth order in the energy, being the product of the LO 
vertex (O(1)), a baryon propagator (© (q7!)) and an NN7 vertex (O(q)). Since 
the kinematic part of the pole diagrams, w ysu ~ ø - q, is of first order in q, the B 
amplitudes themselves are of order B ~ q7! ~ 1/Am for the baryon pole. Kaon 
poles and higher-order chiral lagrangians enter at next order, i.e., having one power 
of the momentum transfer. 

The lowest-order chiral SU (3) analysis provides a fit to the data in terms of two 
parameters, called F and D, 


V6 


1 iA = 
IA nnr = —~—(F + D), Dt pr’ 
LÀ Anr? oF, | + D) OF 


(D — F), 
H (6.10) 
[Ay+ sant = 0, [Ag = _—(3F-—D 
L Bo, An? 2 F, ( ), 
with other amplitudes being predicted by the AJ = 1/2 rule. Use of the numerical 
values 
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— = —0.42, 
F 2F, 


= 0.92 x 1077, (6.11) 


leads to the excellent fit of the S-wave amplitudes seen in Table XII-5. Note that 
this form has one less free parameter than the general SU (3) structure [MaRR 69]. 
Thus, the prediction of chiral symmetry that Ay+-.,,+ œ% 0 is independent of the 
D/F ratio (up to AI = 3/2 effects), and represents a successful explanation of the 
smallness of this amplitude. 

In principle, the A amplitudes, together with the strong B B’z vertices, determine 
the baryon pole contributions to the B amplitudes. These are then parametrized by 
the same d/f ratio as in the axial current,!° e.g., 


2f PrE 
Mat Etr = evn su gq! = 4 ity, ysu q” 
28nnn f - M 
= —— u u : 6.12 
Ding Fed YuYsu q (6.12) 


Using this parameterization for the pole diagrams, one finds contributions such as 


my +m» (d— f)Ms+p 


2my Fr my — My 


B+ pr = (6.13) 
Taking d + f = 1.27, d/f = 1.8, one obtains from relations like this the disap- 
pointing P-wave predictions quoted in Table XII-5. This failure to simultaneously 
fit the S waves and P waves is a deficiency of the lowest-order chiral analysis. Per- 
haps this is not too surprising, as the chiral expansion converges slower in baryons 
than in mesons [BoH 99]. At the next order in the energy expansion, the chiral 
analysis contains enough free parameters to accommodate the data, but is not pre- 
dictive. Lattice studies are just beginning to explore this topic [BeBPS 05]. 


Problems 


(1) The axial-vector coupling 


Consider the effective lagrangian in Eq. (3.1) for nucleons and pions. For com- 
bined left-handed and right-handed transformations of the fields, we have 


U > LUR, €> LEV'=VER', NOVN, 


where L[R] are the spacetime independent SU (2) matrices corresponding to 
global transformations in SU(2);[SU(2)r] and V = V(a(x)) is an SU (2) 


10 This statement is the SU (3) generalization of the Goldberger—Treiman relation, Eqs. (3.20), (3.24). 


(2) 
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matrix describing a vectorial transformation of the nucleons. For the lagrangian 
of Eq. (3.1), use Noether’s theorem to generate the SU (2) axial-vector current, 


Aj, = E Pyuys (EX8? + EAE) Y, 


where & is the ‘square root’ of U (cf. Eq. (3.1)), and thereby show that the 
axial-vector coupling constant for beta decay is given by g; = ga. 

CP violation and nonleptonic hyperon decay 

Although the AS = 1 hamiltonian of the Standard Model contains a CP- 
violating component, there is no practical way to see this in any single hyperon 
decay mode. Rather, one must compare the decays of hyperons with those of 
antihyperons [DoHP 86]. In the presence of CP violation, there are two sources 
of phases in the weak matrix elements, e.g., for the A decay modes, 


„aS oes nS sS 

ô 5 

Aj pr- = Aj el ei + A3 e” e’, 
>. P +P i P “aie 

5 5 
Ba+pr- = B e”! ei + Bz e' eS, 


where the isospin (Z) subscripts ‘1, 3’ stand for AJ = 1/2, 3/2, the angular 
momentum (J) superscripts ‘S, P’ stand for S waves or P waves, A; are real 
amplitudes, 57 are strong final-state phases, and g/ are the weak CP-violating 
phases. Observe that there are three small parameters in these amplitudes — the 
weak phases 7, the strong phases ôf ~ 10°, and the ratio of AJ = 3/2 to 
AI = 1/2 effects. To leading order in these quantities, show that one has the 
CP-odd observables, 


ftf = sin (pf — 97), a = — sin (pf — gf) sin (8? — ôr), 
Lpr- — Waa _ Aids sin(ôf — 83) sin(g? — 93) 
Ppr- + p+ |A|? + |Bi/? 
B: B3 sin(6? — 6?) sing? — oF) 
|41]? + |811? l 


A hierarchy is apparent in these three signals. The 6 + 6 asymmetry requires 
only the weak phase, the w + œ asymmetry requires both the weak and final- 
state phases, while r — T has both phases plus a AJ = 3/2 suppression. 
Present experiments are not sufficiently sensitive to test for CP violation in 
these observables at the required accuracy. 


XI 


Hadron spectroscopy 


Studies of hadron masses, and of both strong and electromagnetic decays of 
hadrons, provide insights regarding QCD dynamics over a variety of distance scales. 
Among various possible theoretical approaches, the potential model has most heay- 
ily been employed in this area. We shall start our discussion by considering heavy- 
quark bound states, which begin to approximate truly nonrelativistic systems and 
for which the potential model is expected to provide a suitable basis for discussion. 


XIII-1 The charmonium and bottomonium systems 


Quarkonium is the bound state of a heavy quark Q with its antiparticle. Two 
such systems, charmonium (cĉ) and bottomonium (bb) have been the subject of 
much experimental and theoretical study; a comprehensive overview is provided by 
[Br et al. 11]. Due to weak decay of the top quark, the tt system has rather different 
properties from these, and thus constitutes a special case (cf. Sect. XIV—2). 

Since the quarkonium systems are quark—antiquark composites, we shall employ 
the set of quantum numbers n, L, S, J introduced in Sect. XI-2. One generally 
refers to the individual quarkonium levels with the nomenclature of Table XIII-1, 


Table XIII-1. Nomenclature for S-wave and P-wave states 
in the cc and bb systems. 


L 5 Charmonium Bottomonium 
1 yns)’ Y(nS) 
0 Nc(nS) n (ns) 

il 1 Xe (nP) Xo (nP) 
0 he(nP) hp(nP) 


“For historical reasons, the spin-one charmonium ground state is 
called J/w. 
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Fig. XII-1 The low-lying spectrum of charmonium. 


although the n L identification is sometimes replaced by either the degree of excita- 
tion or the mass, e.g., Y (2S) is called Y’ or (3686). The n25+!L, spectroscopic 
notation is also invoked on occasion. 

Figs. XIII-1,2 give a summary of the lightest observed cĉ and bb states. Most of 
these states, as well as their transitions have been detected both in the charmonium 
and bottomonium systems [Br ef al. 11]. The largest set of observed excitations 
comes from the y (nS) and Y(nS) radial towers, reaching up to n = 6 for the T 
system. Excitation energies are relatively small on the scale of the bottomonium 
reduced mass up ~ 2.5 GeV, but not that of charmonium ue ~ 0.8 GeV. 

Phenomenological potentials: Historically, the success of potential models in 
charmonium was of major importance in convincing the community that quarks 
were simple dynamical objects and that QCD provides a manageable theory of the 
strong interactions. Because of this success, we describe the states by the spec- 
troscopic classification of nonrelativistic quantum mechanics. Thus, quarkonium 
mass values are often expressed as 


Mntsy) = 2Mo + Enis, (1.1) 


10.5 


Mass (GeV/c 2) 


9.5 


J PO. ot 177 qo ott q+ ott 


Fig. XI-2 The low-lying spectrum of bottomonium. 


368 Hadron spectroscopy 


oo 
N WAB 


een S 1 
Oscillator Quarkonium Coulomb 


Fig. XIN-3 Energy levels of various potential functions. 


where Ejnzrsy; is obtained by solving the Schrodinger equation for a particle of 
reduced mass 4g = Mg/2 moving in the field of an assumed potential energy 
function. The shape of the potential is chosen via a combination of theoretical and 
phenomenological considerations. 

The spectra of quarkonium states already hints at the radial dependence of 
the QQ potential, with the progression in nL levels suggesting an interaction 
which lies ‘between’ Coulomb and harmonic oscillator potentials, as depicted in 
Fig. XIII-3. Conceptually, the simplest potential that matches QCD to this 
behavior is 


Vir) = br —~ +p, (1.2) 
T 


where a, b, Vo are constants and the color dependence between quark and antiquark 
is that in Eq. (XI-2.4). The Coulomb-like 1/r component is designed to reproduce 
one-gluon exchange at short distance. The confining linear ‘br’ term models a 
color-flux tube of constant energy density, as noted in Sect. XI-2. The coefficient 
b is commonly described in the literature as the string tension, in reference to the 
string model of hadrons, and its value is estimated from a string model relation 
involving the typical slope a’ of a hadronic Regge trajectory (cf. Table XIII-2), 


b = 27a)! ~ 0.18 GeV’. (1.3) 


This is equivalent to a restoring force of about 16 tons! 

In practice, phenomenological studies of quarkonium can be carried out by adopt- 
ing the potential of Eq. (1.2) or another assumed potential energy functions. Exam- 
ples include the following, e.g.,! 


-HF {[4?In (1+ a/d] {A ~ 0.4 GeV}, 


2 
Vir) = br —a/r ee). te) 
apt e GeV | , 


1 The second and third potentials provide fits only up to an additive constant. 
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Table XIII-2. Regge trajectories. 


Trajectory N Slope“ J-intercept 
N 3 0.99 —0.34 
A 3 0.92 0.07 
A 3 0.94 —0.64 
> 3 1.1 =1.2 
x* 2 0.91 —0.24 
T 3 0.72 —0.05 
p 4 0.84 0.54 
K 4 0.69 —0.22 
K* 4 0.86 0.29 


“Tp units of GeV 7? 


where F {. .. } denotes a Fourier transform. The first two of the potentials in Eq. (1.4) 
are commonly called the ‘Richardson’ [Ri 79] and ‘Cornell’ [EIGKLY 80] poten- 
tials, respectively. They are constructed to mimic QCD by exhibiting a linear con- 
fining potential at long distances and single gluon exchange at short distances. 
The Richardson potential even incorporates the asymptotic freedom property for 
the strong interaction coupling. The third is a power-law potential [Ma 81] which, 
although not motivated by QCD, can be of use in analytical work or in obtain- 
ing simple scaling laws. The power-law potential also serves as a reminder of 
how alternative forms can achieve a reasonable success in fitting bb and cé spec- 
tra, which, after all, are primarily sensitive to the limited length scale 0.25 < 
r(fm) < 1. 

From the viewpoint of phenomenology, it is ultimately more useful to appreciate 
the general features of the Q Q static potential than to dwell on the relative virtues 
and shortcomings of individual models. 


Effective field theories: The full theory of QCD is richer than can be captured in a 
single potential function. Gluon degrees of freedom can be dynamically active, and 
field-theoretic corrections introduce subtle modification to masses and couplings. 
Effective field theory techniques provide a modern way of understanding both the 
perturbative and nonperturbative properties of heavy-quark systems. 


There are various scales associated with quarkonium systems. The heavy-quark 
mass sets a hard scale. Degrees of freedom associated with this scale may be treated 
perturbatively. Scales connected to the momentum transfer in the bound state, 
p ~ mwv, are related to the typical spatial extent, (7), of the bound state. The 
time scales involved for quarkonium dynamics are related to the nonrelativistic 
kinetic energy E ~ mv?/2. For large quark mass, the velocity, typically of order 
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v? ~ 0.1 — 0.3, can be treated as a small parameter such that each of these scales 
is technically distinct, with 


mg > mov > mov. (1.5) 


Different versions of effective field theories can be invoked to treat the different 
scales [BrPSV 05]. 

In Non-Relativistic QCD, abbreviated as NRQCD, degrees of freedom of order 
mg are integrated out from the theory [CaL 86]. This leaves the light degrees of 
freedom being the full set of particles of QCD. The gluons (and light quarks) 
are included dynamically, but are treated with an ultraviolet cut-off of order mg 
because their high-momentum components have been integrated out.” The heavy 
quark itself is treated nonrelativistically. Because the hard modes have been inte- 
grated out, there appear higher-order gauge-invariant interactions with Wilson coef- 
ficients that parameterize the strength of the new terms. The effective lagrangian 
then starts out as 


L = Log + Log + Looe (1.6) 


where Log is the usual lagrangian for gluons and Log is the lowest-order lagrangian 
for nonrelativistic quarks 


Lo =v iD, + | y (1.7) 
2mo 


where Do, D give the coupling of the heavy quarks to gluons. To lowest order in 
the both the QCD coupling constant and in the heavy-quark expansion one can set 
the Wilson coefficient C = 1, but perturbative corrections lead to different defini- 
tions of the heavy-quark mass (see Sect. XIV—1) and C can account for matching 
onto these definitions. Operators that are higher order in the 1/mg expansion also 
emerge. Examples are 


c C Coe. 
Looe = Y? f =D 4: > Gg | w+ ouievivste (18) 
Mo mo Mo 


The first two terms here describe higher-order interactions with gluons, while the 
last term is a contact interaction which mimics the effect of a potential. In effective 
field theory, the contact interaction is appropriate because it comes from the higher- 
momentum modes above the scale p ~ mv. The gluonic Coulomb interaction is 
still treated perturbatively. There will be further contact interactions for different 
spin and color combinations. 


2 See the discussion of Sec. IV-7. 
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Because this effective theory includes gluons, there are perturbative corrections 
also to the heavy-quark mass. These are discussed more fully in Sect. XIV—1. For 
the purposes here, we will note that the definition of the quark mass is tied up with 
an overall energy shift in the potential, previously denoted by Vo in Eq. (1.2). Def- 
initions of the mass which are perturbatively well-behaved are those that are tied 
to physical thresholds [HoSSW 98]. Effectively, this absorbs Vo into the definition 
of the quark mass within some specific prescription. Because this prescription may 
vary, the appropriate kinetic energy mass in Eq. (1.7) may have a different value, 
leading to C Æ 1. 

One can go further and integrate out degrees of freedom between p ~ mv and 
E ~ mv’. Since these modes are below the spatial scale of the bound state, 
contact interactions are no longer appropriate, but they must be replaced by a 
spatially dependent potential [PiS 98]. Such an effective field theory is labeled 
DNRQCD, with the ‘p’ referring to the potential. This starts to make closer contact 
with the phenomenological potential models. However, it remains a field theory 
and there are controlled perturbative modifications from the so-called ‘ultra-soft 
modes which remain dynamical at this scale [HoS 03]. 

The effective field theory treatments put many of the early successes of phe- 
nomenological potential models onto a firmer footing. Moreover, they have also 
been successful at helping to connect lattice calculations to the phenomenology of 
quarkonium. 

Lattice studies: Lattice-gauge theory is well suited to the exploration of the 
heavy-quark potential [DeD 10, GaL 10, Ro 12]. In the heavy-quark limit, the 
quarks become static and their interaction energy can be measured by numerical 
methods. Such studies confirm the general picture of a “Coulomb plus linear’ inter- 
action. However, the lattice calculations can also provide the connection between 
the physical values of the parameters to the underlying scale of QCD, Agcp. 

In general, the static interaction can be described by a function 


= d*q igr (a°) 
V(r)= -f em T (1.9) 


At large q, the coefficient a(q) is determined by the perturbative expansion of 
QCD, which has now been accomplished to three-loop order [AnKS 10]. Numeri- 
cal studies must then match on to the perturbative results at short distance, and this 
can be accomplished.? In doing so, the residual interactions can be mapped onto 
the operators of NRQCD and/or the potential of pVRQCD. While the state of the 
art continues to advance, the present connection between theory and experiment in 
the quarkonium spectrum is impressive [Br et al. 11]. 


3 See, e.g., [Le 98] 
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Transitions in quarkonium 


All quarkonium states are unstable. Among the decay mechanisms are annihilation 
processes, hadronic transitions, and radiative transitions. Roughly speaking, the 
lightest quarkonium states are relatively narrow, but those lying above the heavy- 
flavor threshold, defined as twice the mass of the lightest heavy-flavored meson 
and depicted by dashed lines in Figs. XHI—1 and XIN-2, are broader. This pattern 
is particularly apparent for the ° S; states — below the heavy-flavor threshold, widths 
are typically tens of keV, whereas above, they are tens of MeV. The primary reason 
for this difference is that above the heavy-flavor threshold, quarkonium can rapidly 
‘fall apart’ into a pair of heavy-flavored mesons, e.g., Y[45] — BB, whereas 
below, this mode is kinematically forbidden. 

In the following, we shall describe only decays which occur beneath the heavy- 
flavor threshold, and shall limit our discussion to annihilation processes and 
hadronic decays. Radiative electric and magnetic dipole transitions are adequately 
described in quantum mechanics textbooks. 

Annihilation transitions: To motivate a procedure for computing annihilation 
rates in quarkonium, let us consider the simple case of a charged lepton of mass 
m moving nonrelativistically with its antiparticle in a ! Sọ state, and undergoing a 
transition to a two-photon final state.* First, we write down the invariant amplitude 
for the pair annihilation process, 


i 
M = —ie¥(py, à+) Ce F T] u(p-, à-), 
P_- 4) P_—4,—m 
(1.10) 
for momentum eigenstates. In the lepton rest frame, we are free to choose the trans- 
verse gauge €f - p- = €} - p- = 0, ie. €?, = 0. Since *S; states can make no 


contribution to the two-photon mode, we can compute the squared-amplitude for a 
! Sọ transition by summing over initial state spins, 


et 1 (629) 2 
yo IMP? = a TBE + CROI (1.11) 


he @ wi 


where œ 2 are the photon energies in the lepton rest frame. Near threshold, the 
photons emerge back to back, and the differential cross section is found to be 


do a? 2 
= (1- (€}-63)’). 1.12 

dQ 2mv4 ( a i 
Likewise, near threshold, a sum on photon polarizations gives 


4 The! So (GRY 1) States have even (odd) charge conjugation, and can therefore give rise to even (odd) numbers 
of photons in an annihilation process. 
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Fig. XIN—4 Decay of quarkonium through annihilation. 


Da-i er) =2, (1.13) 


and upon integrating over half the solid angle (due to photon indistinguishability) 
we obtain the cross section, 
4a?’ r 
O=- (1.14) 
mM YV, 
This is the transition rate per incident flux of antileptons. Since the flux is just the 
antilepton velocity v times a unit lepton density, we interpret vio as the transition 
rate for a density of one lepton per volume. For a bound state with radial quantum 
number n and wavefunction W,,(x), the density is |W,,(0)|? and the lowest-order 


expression for the electromagnetic decay rate I" ie So] becomes 
4 2 
rem [iso] = v4.6 1%, OF? = —— 1H, OP. (1.15) 
m 


The corresponding rate for yy emission from ! Sọ states of the bb (T) system is 
obtained from Eq. (1.15) by including a factor e} = 1/81, which accounts for the 
b-quark charge, and a color factor of three. Determination of the two-gluon emis- 
sion is found similarly (cf. Fig. XI[I—4(a)) provided the gluons are taken to be 
massless free particles, and is left for a problem at the end of the chapter. Including 
also the effects of QCD radiative corrections, referred to a common renormaliza- 
tion point ur = mp, we have [KwQR 87] 


487 a? |W,, (0) |? a; (mp) 
Pysyy [n' So] = 81(2m,)2 f — 3.4 x | : 
327&?(mp)|Y, (0)? as (mp) 
TY—>gg [1' So] = Om f zr f oe) 


Decays can also occur from the n?S; states. The single-photon intermediate 
state of Fig. XIII-4(b) leads to emission of a lepton pair, whereas Fig. XIII—4(c) 
describes final states consisting of three gluons, two gluons and a photon, or three 


5 There are annihilations from higher partial waves as well. These involve derivatives of the wavefunction at 
the origin. 
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photons. For such a three-particle final state, there are six Feynman diagrams per 
amplitude and three-particle phase space to contend with. Upon including QCD 
radiative corrections, the results are [KwQR 87] 


1677 a7|W,, (0) |? 16 a,(m 
Py, gln*Si] = ae | a 2], 


9(2m,)? 3 x 
40S = HOF f -4o20 |, 
neak s = E =e. od : 
TAM eee Cs T a „OË E n2] (1.17) 


The QCD contributions in Eq. (1.17) are of interest in several respects. They con- 
tribute, on the whole, with rather sizeable coefficients and can substantially affect 
the annihilation rates. Also, they have come to be used as one of several stan- 
dard inputs for phenomenological determinations of œs. To eliminate the model- 
dependent factors |W,,(0)|?, one works with ratios of annihilation rates, 


T'y_,3¢ [nS] 7 5 as (mp) 


Py gey [nS] 4 a (1 _9 s2) 
T kA 


Pysg [n? Si] _ 10 (x? — 9)a3 (mp) (a) (1 4 0.43%") s(m sm), (1.18) 


Py up [nS] 9 TA? 2mp 


In reality, there are a number of theoretical and experimental concerns which make 
the extraction of œ, (mp) a rather more subtle process than it might at first appear: 
(i) the contribution of |Y, (0)|? in Eqs. (1.16), (1.17) as a strictly multiplicative 
factor is a consequence of the nonrelativistic approximation and may be affected by 
relativistic corrections; (ii) there is no assurance that O(a,)” terms are negligible; 
particularly in the light of the large first-order corrections; (iii) experiments see not 
gluons but rather gluon jets, and at the mass scale of the upsilon system, jets are 
not particularly well defined; and (iv) the y spectrum observed in the ygg mode is 
softer than that predicted by perturbative QCD, implying the presence of important 
nonperturbative effects. Nevertheless, determinations of this type lead to the central 
value (and its uncertainties) A = 296 + 10 MeV as extracted from upsilon data 
and cited earlier in Table II-2. This example indicates how demanding a task it is 
to obtain a precise experimental determination of a;(q7). 
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Hadron transitions: The transitions V’ > V + x? and V’ > V + n involving 
the decay of an excited °S, quarkonium level (V’) down to the °S; ground state 
(V) are interesting because they are forbidden in the limits of flavor-SU (2) and 
flavor-SU (3) symmetry, respectively. Their rates are therefore governed by quark 
mass differences, and a ratio of such rates provides a determination of quark mass 
ratios. There is a modest theoretical subtlety in extracting the rates, as degenerate 
perturbation theory must be used [IoS 80]. The leading-order effective lagrangian 
for these P-wave transitions must be linear in the quark mass matrix m, 


Lym = -iz gf Tr (m (U — U*)) #3, Vyda V} 


=c [on = Mu) + (2m; — ma — m) ape | eP Os Vaile Vg, 


T3 
V2 V6 
(1.19) 


where c is a constant. Here, 73 and ng are the pure SU (3) states which appear prior 
to mixing 


z? = cos 13 + sin 0 ng, n = — sin 0 m3 + cos ng, (1.20) 


where tan 0 ~ 0 = J/3(mg — m,)/[22ms — mg — m,)] describes the quark mix- 
ing. Upon calculating the transition amplitudes and then substituting for the small 
mixing angle 0, we obtain 


Mo | 2m, — Md — Mu | 3 Mo 
Myrsyro = = | Ma — mM, + 6) = (mq — Mu), 
Kar aa hae V3 ay" 
Mo | 2ms — ma — m 
Msg eae pE e 
V'—>Vy J d V3 
2M w a u 2 
= °(m, —m) + O (ma = Mu)" (1.21) 
/6 Ms 
where Mo = ic €“"*Pk,,€*k’ eg. The ratio of decay rates is found to be 
r 0 27\|mg—m,|’ p 3 
ge Vr = | Au EE (1.22) 
Tv>vn 16 Ms =m Py 


We can extract a quark mass ratio from charmonium data involving (2S) > 
J/v transitions. From the measured value Q = 0.0396+0.0033 [RPP 12], we find 


Mq — Mu 

—— = 0.0354 + 0.0015, (1.23) 
ms —m 

which is rather larger than the value in Eq. (VII-1.19) extracted from pion and 

kaon masses. 
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XIII-2 Light mesons and baryons 


In the quark model, the light baryons and mesons are Q? and QQ combinations 
of the u, d,s quarks. The resulting spectrum is very rich, containing both orbital 
and radial excitations of the L = 0 ground-state hadrons. For mesons, the Q and 
Q spins couple to the total spins § = 0, 1, and each (L, S) combination occurs 
in the nine flavor configurations of the flavor-SU(3) multiplets 8, 1. Analogous 
statements can be made for baryon states. 

In the face of such complex spectra, we are mainly interested in the regulari- 
ties that allow us to extract the essential physics. A tour through the database in 
[RPP 12] reveals some general patterns.° Both radial and orbital excitations of the 
light hadrons appear 0.5 — 0.7 GeV above the ground states. As pointed out in 
Sect. XI-1, this indicates that the light quarks move relativistically. Other strik- 
ing regularities are (i) the existence of quasi-degenerate supermultiplets of parti- 
cles with differing flavors and equal (or adjoining) spins, and (ii) excitations of a 
given flavor having increasingly large mass (M) and angular momentum (J) val- 
ues, which obey J = a’ M? + Jo. 


SU(6) classification of the light hadrons 


To the extent that the potential is spin-independent and we work in the limit of 
equal u, d, s mass, the quark hamiltonian is invariant under flavor-SU (3) and spin- 
SU (2) transformations. To lowest order, hadrons are thus placed in irreducible 
representations of SU (6), and quarks are assigned to the fundamental representa- 
tion 6, 


6=(utdt st ul dẹ S4) (2.1) 


We can also write the SU (6) quark multiplet in terms of the SU (3) flavor represen- 
tation and the spin multiplicity as 6 = (3, 2). Although the SU (6), invariant limit 
forms a convenient basis for a classification of the meson and baryon states, it can- 
not be a full symmetry of Nature since the spin is a spacetime property of particles 
whereas SU (3) flavor symmetry is not. Thus, it is impossible to unite the flavor 
and spin symmetries in a relativistically invariant manner [CoM 67]. Although we 
shall avoid making detailed predictions based on SU (6), it is nonetheless useful in 
organizing the multitude of observed hadronic levels. 

Meson supermultiplets: The L=0 OO composites are contained in the SU (6) 
group product 6 x 6* = 35 @ 1, where the representations 35, 1 have flavor—spin 
content 


6 Our discussion will focus on hadron masses. Strong and electromagnetic transitions are described in 
[LeOPR 88]. 
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Fig. XII-5 Spectrum of the light mesons. 


35 = (8, 3) @ (8, 1) @ (1, 3), 1= (1,1). (2.2) 


The L = 0 ground state consists of a vector octet, a pseudoscalar octet, a vector 
singlet, and a pseudoscalar singlet. For excited states, the meson supermultiplets 
constitute an SU(6) x O(3) spectrum of particles. The O(3) label refers to how 
the total angular momentum is obtained from J = L + S, giving rise to the pattern 
of rotational excitations displayed previously in Table XI—3. Roughly speaking, 
mesons occur in mass bands having a common degree of radial and/or orbital exci- 
tation. 

Fig. XII-5 provides a view of the mass spectrum for the lightest mesons. The 
SU(6) x O(3) structure of the ground state and a sequence of orbitally excited 
states are observed to the extent that sufficient data are available for particle assign- 
ments to be made. Note that the S-wave QQ states are all accounted for, but gaps 
appear in all higher partial waves. Even after many years of study, meson phe- 
nomenology below 2 GeV is far from complete! 

Baryon supermultiplets: The SU(6) baryon multiplet structure arises from the 
Q? group product (6 x 6) x 6 = (21 ® 15) x 6 = 56 ® 70 ® 70 ® 20, and has 
flavor—spin content 
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Fig. XII-6 The low-lying baryon spectrum. 


56 = (10, 4) @ (8, 2), 
70 = (8,4) & (10, 2) ® (8, 2) @ (1, 2), 
20 = (8, 2) @ (1, 4). (2.3) 


A three-quark system must adhere to the constraint of Fermi statistics. Each baryon- 
state vector is thus antisymmetric under the interchange of any two quarks. A 
Young-tableaux analysis of the above group product reveals that the spin—flavor 
parts of the 56, 70 and 20 multiplets are, respectively, symmetric, mixed, and anti- 
symmetric under interchange of pairs of quarks. Since the color part of any Q? 
color-singlet-state vector is antisymmetric under interchange of any two quarks, 
the 56-plet has a totally symmetric space wavefunction, with zero orbital angular 
momentum between each quark pair. The 70 and 20 multiplets require either radial 
excitations and/or orbital excitations. Recall the characterization of the baryon 
spectrum in terms of the basis defined by an independent pair of oscillators 
(cf. Eq. (XI-2.12)). In this context, a standard notation for a baryon supermulti- 
plet is (R, LẸ), where R labels the SU (6) representation, P is the parity, N labels 
the number of oscillator quanta and L is the orbital angular momentum quantum 
number (cf. Sect. XI-2). 

Like meson masses, baryon masses tend to cluster in bands having a com- 
mon value of N. The first three bands are shown in Fig. XIII-6, and effects of 
SU (6) breaking are displayed for the first two. The lowest-lying SU(6) x O(3) 
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supermultiplet is the positive-parity (56, Of), having content as in Eq. (2.3). Next 
comes the negative-parity (70, 1; ) supermultiplet. This contains more states than 
the 70-plet shown in Eq. (2.3) because the extension from L = 0 to L = 1 requires 
addition of angular momenta, 


(10, 2) —> (10,4) 6 (10,2), (8,4) > (8,6)  &, 4) @ (8, 2), 


(1,2) — 1,4 4 (1, 2), (8, 2) > (8, 4) ® (8, 2). ef 


The number of supermultiplets grows per unit of excitation thereafter. There are 
five SU (6) multiplets in the N = 2 band, (56, 23), (56, 05), (70, 27), (70, 07), and 
(20, 17). Recall that the baryonic inter-quark potential was expressed in Eq. (XI- 
2.10) as V = Voge + U, where Vos. is the potential energy of a harmonic oscillator 
and U = V — Voge. If the potential energy were purely Vosc, the supermultiplets 
within the N = 2 band would all be degenerate. In the potential model, assuming 
that the largest part of U is purely radial, this degeneracy is removed by the first- 
order perturbative effect of U, and the splittings in the N = 2 band are shown at 
the top of Fig. XIII-6. Aside from choosing the (56,03) supermultiplet to have 
the lowest mass, one finds the pattern of splitting to be as shown in Fig. XII-6, 
independent of the particular form of U. 


Regge trajectories 


It is natural to classify together a ground-state hadron and its rotational excita- 
tions, e.g., the isospin one-half positive-parity baryons N (939) ;~1/2 (the nucleon), 
N (1680) j=5/2, N (2220) j=9/2 and N (2700) ;=13/2. Although no higher-spin entries 
have been detected in this particular set of nucleonic states (presumably due to 
experimental limitations), there is no theoretical reason to expect any such sequence 
to end. The database in [RPP 12] contains a number of similar structures, each 
characteristically containing three or four members. 

Each such collection of states is said to belong to a given Regge trajectory. To 
see how this concept arises, let us consider the simplest case of two spinless parti- 
cles with scattering amplitude f(E, z) (i.e. do/dQ = | f(E, z)|*), where E is the 
energy and z = cos 0 is the scattering angle. It turns out that analytic properties of 
the scattering amplitude in the complex angular momentum (J) plane are of inter- 
est. One may obtain a representation of f(£, z) in the complex J-plane by con- 
verting the partial wave expansion into a so-called Watson—Sommerfeld transform, 


f(E,2 =) (-)' 26+ DatE, £) Pe(—2) 


£=0 


1 T 
5 Ọ dJ —— QJ + DalE, J)Ps(—2), (2.5) 
2xi Je sinz J 
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where P; is a Legendre polynomial and C is a contour enclosing the nonnegative 
integers. Suppose that as C is deformed away from the Re J-axis to, say, a line of 
constant Re J, a pole in the partial wave amplitude a(£, J) is encountered. Such 
a singularity is referred to as a Regge pole and contributes (cf. Eq. (2.5)) to the full 
scattering amplitude as 


BLE] Pate\(—z) 7 
sin(za[E]) 

where a[ EF] is the energy-dependent pole position in the complex J-plane and 6[ E] 

is the pole residue. 

The Regge-pole contribution of Eq. (2.6) can manifest itself physically in both 
the direct channel as a resonance and a crossed channel as an exchanged particle. 
Here, we discuss just the former case by demonstrating how a given Regge pole can 
be related to a sequence of rotational excitations. Suppose that at some energy Er, 
the real part of the pole position equals a nonnegative integer £, i.e., Re a[ER] = £. 
Then, with the aid of the identity, 


a ie 1 sin(@) 
= dz P;(z) P»(—z) = =, 2.7 
ae PAEO er Eee oa 
we can infer from Eq. (2.6) the Breit-Wigner resonance form, 
z 1 T/2 
S = Pag) 
eiel|E|=—{)@[E] +£+1) B= Er+ir/2 


provided Re a[Er] > Im a[ Er]. A physical resonance thus appears if a[ E] passes 
near a nonnegative integer and, if the Regge pole moves to ever-increasing J values 
in the complex J-plane as the energy E is increased, it generates a tower of high- 
spin states. Except in instances of so-called exchange degeneracy, parity dictates 
that there be two units of angular momentum between members of a given trajec- 
tory. In this manner, a single Regge pole in the angular-momentum plane gives rise 
to the collection of physical states called a Regge trajectory. 

A plot of the angular momentum vs. squared-mass for the states on any meson 
or baryon trajectory reveals the linear behavior, 


J >a M? + J. (2.9) 


A compilation of slopes (œ’) and intercepts (Jo) appears in Table XIII—2, with each 
trajectory labeled by its ground-state hadron. Such linearly rising trajectories have 
been interpreted as a consequence of QCD [JoT 76]. In this picture, hadrons under- 
going highly excited rotational motion come to approach color-flux tubes, where- 
upon it becomes possible to relate the angular momentum of rotation to the energy 
contained in the color field. This line of reasoning leads to the behavior of Eq. (2.9), 
and accounts for the universality seen in the slope values displayed in Table XIII-2. 
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SU(6) breaking effects 


Although an SU (6)-invariant hamiltonian provides a convenient basis for describ- 
ing light hadrons, the physical spectrum exhibits substantial departures from the 
mass degeneracies which occur in this overly symmetric picture. In the following, 
we shall consider some simple models for explaining the many SU (6)-breaking 
effects observed in the real world. 

The QCD Breit-Fermi model: If one ascribes the nonconfining part of the quark 
interaction to single-gluon exchange, the nonrelativistic limit yields the ‘QCD 
Breit—Fermi potential’ [DeGG 75] 


Aka, 
Vone-gluon == 3y 
4kg 87 T 1 1 
2 229-5" _§3 ee R 
+ MM jr [3(s; -F)(S; -Ê -S; -s;] 
P S; ¿r x pi Sj: r xpi Sj: rX pj —S;-r xX pj 
r3 2M? 2M; MiM; 
<p deep pot 2.10 
+e Mer (pi - py +F@- pi) p)| (2.10) 


where œs is the strong fine structure constant, r = r;j, and k denotes the color 
dependence of the potential (cf. Sect. XI-2) with k = 1 (1/2) for mesons (baryons). 
In keeping with the potential model, the mass parameters {M;} are interpreted 
as constituent quark masses. Although the QCD Breit-Fermi model incorporates 
SU (6) breaking by means of both quark mass splittings and spin-dependent inter- 
actions, it lacks a rigorous theoretical foundation. One might argue on the grounds 
of asymptotic freedom that Eq. (2.10) does justice to physics at very short distances 
(in the approximation that œ, is constant), but there is no reason to believe that it 
suffices at intermediate-length scales. It also does not account for mixing between 
isoscalar mesons, so such states must be considered separately. 

Meson masses: The gluon-exchange model can be used to obtain information 
on constituent quark mass. In the following, we shall temporarily ignore the minor 
effect of isospin breaking by working with M = (M, + Mo) /2. To compute meson 
masses, we take the expectation value of the full hamiltonian between SU (6) eigen- 
states, specifically the L = 0 QQ states.’ Although the form of Eq. (2.10) implies 
the presence of spin-spin, spin-orbit, and tensor interactions, the spin—orbit and 


7 An analysis of spin dependence in the L = | states is the subject of a problem at the end of the chapter 
(cf. Prob. XIII-3)). 
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tensor terms do not contribute here because each quark pair moves in an S wave, 
and it is the spin-spin (hyperfine) interaction which lifts the vector meson states 
relative to the pseudosclar mesons. We can parameterize the nonisoscalar L = 0 
meson masses as 
: ‘ (pp) (Po) 
m` -= SAM NG ou T amg eee Sgh (2.11) 
where ñ and n, are the number of nonstrange (n) and strange consituents (s) respec- 
tively, and Hog refers to the hyperfine interaction in the second line of Eq. (2.10). 
One consequence of Eq. (2.11) is a relation involving the mass ratio M /M;. 
Fitting the four masses x (138), K (496), o(770), K*(892) to the parameters in 
Eq. (2.11) yields 
= Uns M 
MEETME 2 T a ee}. (2.12) 


Mp — My Han M; 


The origin of this result lies in the inverse dependence of the hyperfine interaction 
upon constituent quark mass, which affects the mass splitting between S = 1 and 
S = 0 mesons differently for strange and nonstrange mesons. The numerical value 
of M /M, in Eq. (2.12) graphically demonstrates the difference between constituent 
quark masses and current quark masses, the latter having a mass ratio of about 
0.04. In earlier sections of this book, which stressed the role of chiral symmetry, 
the pion was given a special status as a quasi-Goldstone particle. In the Q Q model, 
the small pion mass is seen to be a consequence of severe cancelation between the 
spin-independent and spin-dependent contributions. However, the parameterization 
of Eq. (2.11) cannot explain the large 7'(960) mass. 

In addition to the SU (6) symmetry-breaking effects of mass and spin, there is an 
additive contribution present in the isoscalar channel, which is induced by quark- 
antiquark annihilation into gluons. In the basis of u, d,s quark flavor states, this 
annihilation process produces a3 x 3 mass matrix of the form 


2M, + X X X 
X 2Maı + X X ; (2.13) 
X X 2M; +X 


where for C = +1(—1) mesons, X is the two-gluon (three-gluon) annihilation 
amplitude, and for simplicity we display just the quark mass contribution (2M;) 
as the nonmixing mass contribution. The annihilation process is a short-range phe- 
nomenon, so the magnitude of X depends sharply on the orbital angular momentum 
L of the OO system. For L Æ 0 waves (where the wavefunction vanishes at zero 
relative separation), and C = —1 channels (where the annihilation amplitude is 
suppressed by the three powers of gluon coupling), we expect M, — M > X.Inthis 
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limit, diagonalization of Eq. (2.13) yields to leading order the set of basis states 
(uu + dd)//2 and 5s. Only the L = 0 pseudoscalar channel experiences opposite 
limit X > M, — M , wherein to leading order the basis vectors are the SU (3) sin- 
glet state (uu +dd +5s)/V/3 and octet states (uu — dd)/V2, (au+dd— 255)//6. 
The overall picture that emerges is one of relatively unmixed light pseudoscalar 
states, and heavily mixed vector, tensor, etc., states. 

Baryon masses: Applying the one-gluon exchange potential to the ground-state 
baryons of (56, 05) yields a mass formula analogous to Eq. (2.11), 


mE = AM + nM, + y et 


isl i<j 


Ji (2.14) 


For the system of 1/2* and 3/2" (iospin-averaged) baryons, there are eight mass 
values and since the above mass formula contains five parameters, one should 
obtain three relations. The additional perturbative assumption Hss — Hns = Has — 
Hm for the hyperfine mass parameters yields the Gell-Mann—Okubo relation of 
Eq. (XII-3.10) for the 1/2" baryons and the equal spacing rule for 3/2* states, 


Myx — Ma = Max — Myx = MQ — Ms. 


(Expt. 153 MeV = 149 MeV = 139 MeV) (2.15) 


A third relation which relates the 3/2* and 1/2+ masses and is independent of 
further perturbative assumptions has the form 


3m a =Ma = 2my = 2(my» = ma) 
(Expt. : 276 MeV = 305 MeV) (2.16) 


In addition, one can obtain estimates for M /M;, among them 


M 2(my* — my) 
M, ~ ms. + my — 3m, 
M — 


~ 0.62, 


Har 20,65, (2.17) 


M; ma—my 


both in accord with Eq. (2.12). 

Isospin-breaking effects: The above description of SU (6) breaking assumes 
isospin conservation. In fact, hadrons exhibit small mass splittings within isospin 
multiplets, arising from electromagnetism and the u — d mass difference. In the 
pion and kaon systems, we were able to use chiral SU (3) symmetry to isolate each 
of these separately. Unfortunately, this is not possible in general, and models are 
required to address this issue. 
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There are a few consequences which follow purely from symmetry considera- 
tions. Since the mass difference m, — mq is AJ = 1, the AJ = 2 combinations 


my+ + my- — 2myo = 1.7 £0.1 MeV, m+ — mp = —0.3 + 2.2 MeV, 
(2.18) 


arise only from the electromagnetic interaction. In addition, both electromagnetic 
and quark mass contributions satisfy the Coleman—Glashow relation [CoG 64], 


My+ —My- +m, —m, +mg- —mgo =0 


[Expt. 0.4 + 0.6 MeV = 0]. (2.19) 


For electromagnetism, this is a consequence of the U-spin-singlet character of the 
current, whereas for quark masses it follows from the AJ = 1 and SU(3)-octet 
character of the current. 

We proceed further by using a simple model, based on the QED Coulomb and 
hyperfine effects, to describe the electromagnetic interaction of quarks, 


Atco = Acoul > Qi Qj, 


i<j 


Amnyp = —Abyp 3 aT Ag ‘Sj, (2.20) 
iMj 


where Acou, Anyp are constants, {Q;} are quark electric charges, and the sums are 
taken over constituent quarks. In Ampyy, we shall neglect further isospin break- 
ing in the masses and use M, = My = M , and assume electromagnetic self- 
interactions of a quark to be already accounted for in the mass parameter of that 
quark. For any values of Agou and Anyp, this model contains the sum rule 


1 
(mn — Mp)em = =g UES + my- — 2ms0) = —0.57 + 0.03 MeV, (2.21) 


leaving the excess due to the quark mass difference, 

m,—m 
2 

(ma — My)(dn + fm)Zo 

= (Mn — Mp) — (Mn — Mp)em = 1.86 + 0.03 MeV, (2.22) 


d a = My — Mq 2 = 
- (n|uu — dd|n) — DE (pluu — dd| p) 


(Mn — Mp)qm = 


where the second line in the above uses the parameterization of hyperon mass split- 
tings given in Eq. (XII-3.9). To the extent that this estimate of quark mass differ- 
ences is meaningful, one obtains the mass ratio, 

Mq — My (my — Mp )qm 


= ~ 0.015, (2.23) 


ms — m mz — my 


o 
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to be compared to the chiral-symmetry extraction from meson masses, which 
yielded 0.023. With further neglect of terms O(@(M, — M)) in the hyperfine inter- 
action, this exercise can be repeated for vector mesons to yield 


2 
(mgo — MK+)em = —3 Mot — mp) = 0.2 + 1.5 MeV, 


(mgo = MK*+)qm = (M go = mg+) = (m gso = M g+ Jem 


= 6.5 +1.9 MeV, 
ee a = 00s 0G. (2.24) 
mMm, — M mg» — mp 


The additional assumption that the constants Acou! and Anyp are the same in the 
decuplet baryons and the octet baryons, as is true in the SU (6) limit, leads to 


5 
(Ma++ — Mpo)em = qini + my- — 2ms0) = 2.8 + 0.2 MeV, 


(Mat — Mao)qm = (Mat++ — Mao) — (Mat — M ao)em 


— —5,540.4 MeV, 
= i 1 — 

eS o A = 01S 0000. (2.25) 

ms; — m 2 myx — ma 


Of course, the spread of values for the mass ratios raises a concern about the valid- 
ity of this simple model. However, all methods of calculation agree on the small- 
ness of the ratio (mg — m,)/(m; — M). 


XIII-3 The heavy-quark limit 


In the quark description, a heavy-flavored hadron contains at least one of the heavy 
quarks c, b, t. An effective field theory, Heavy Quark Effective Theory (HQET), 
has been developed which provides a powerful tool for heavy quark physics. This 
involves a study of the limit (mg — oo) in which the theory is expanded in powers 
of mọ. We describe a simple introduction to the topic and much more detail can 
be found in [MaW 07]. 


Heavy-flavored hadrons in the quark model 


The spectroscopy of heavy-flavored hadrons should qualitatively follow that of the 
light hadronic spectrum, with states containing a single heavy-quark Q occurring 
as either mesons (Qq) or baryons (Qqıq2). The lowest-energy state for a given 
hadronic flavor will have zero orbital angular momentum between the quarks, 
leading to ground-state spin values § = 0,1 for mesons and § = 1/2, 3/2 for 
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Charmed mesons Charmed baryons 


Fig. XIII-7 Spectrum of charmed (a) mesons, (b) baryons. 


baryons. The hyperfine interaction will lower the S$ = 0 meson and S = 1/2 baryon 
masses, and both orbital and radial hadronic excitations of the ground state will be 
present. 

Although it is possible to contemplate extended flavor transformations which 
involve interchange of the light and heavy quarks, e.g., as in the SU (4) of the 
light and charmed hadrons, such symmetries are so badly broken by the difference 
in energy scales Mg >> M, and Mg > Agcop as to be rendered useless. The 
SU (3)- and SU (2)-flavor symmetries associated with the light hadrons are still 
viable, but multiplet patterns become modified. The mesons Qq will exist in the 
SU (3) multiplet 3*, whereas in the baryonic Qqıq2 configurations the two light 
quarks q1, q2 will form the flavor-SU (3) multiplets 6 and 3*. For example, the 
charmed system has the meson ground state 


3*: D*[cd], D len], D" [c5], 


which displays the mass pattern of an SU (2) doublet (Dié Dhe) and an SU (2) 
singlet (Djo¢9). The charmed-baryon multiplets are 


6: Ui*[uuc], Ut [udc], Uddc)], Ef [usc], EO [dsc], Q°[ssc] 


3*: At[udc], Bi*[usc], E% [dsc]. 


Fig. XIII-7 displays the anticipated charmed-meson and charmed-baryon levels, 
including the effect of SU (3) breaking. 

Heavy-quark constituent mass values can be inferred from the D*— D and B*— B 
hyperfine splittings. That the former splitting is about three times the latter is a 
consequence of M, ~ 3M, and of the inverse dependence of the hyperfine effect 
upon quark mass. Analogously to Eq. (2.17), we find 


M = M $ = 
EE GO. = TE 608: 3.1) 


Me Mp — My M, Mp — My 
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where M = (M,, + M,)/2. These findings depend to some extent on how the 
fit is done, e.g., with mesons or with baryons, and we leave further study for 
Prob. XIIH+. 


Spectroscopy in the mg — œ limit 


In a hadron which contains a single heavy quark Q along with light degrees of 
freedom, the heavy quark is essentially static. The best analogy is with atoms, 
where the nucleus can in the first approximation be treated as a static, electrically 
charged source. Likewise, for heavy hadrons the heavy quark is a static source 
with color charge, and the light degrees of freedom provide a nonstatic hadronic 
environment around Q. This scenario can be formalized by partitioning the heavy- 
quark lagrangian as [CaL 86, Ei 88, LeT 88] 


Lo = Y (ip = mg) y= Lo + L space 
Lo=W(iyDo—mo)v, — Lspace = —i Yy - Dy, (3.2) 


where D,, is the covariant derivative of SU(3),. Since the spatial y matrices 
connect upper and lower components, we see that the effect of Lspace iS O(m’). 
Observe that the static lagrangian Lo of Eq. (3.2) is invariant under spin rotations 
of the heavy quark Q. In the world defined by Lo, with both O(A gcp/ Mo) effects 
and O(a;(MQ)) effects (associated with hard-gluon exchange) ignored, heavy- 
hadronic energy levels and couplings are constrained by the SU (2) spin symmetry. 
It is helpful to visualize the situation. A heavy-flavored hadron of spin S will con- 
tain a static quark Q having a constant spin vector Sg (with Sg = 1/2) and light 
degrees of freedom having a constant angular momentum vector Je = S — So.8 
For a meson of this type, we assume that J; behaves as it does in the quark model, 
with Je = 1/2 in the ground state and J; = L + 1/2 for L > 0 rotational excita- 
tions. From the decoupling of the heavy-quark spin, it follows that there will be a 
two-fold degeneracy between mesons having spin values S = J + 1/2. The meson 
L = 0 ground state will have Jẹ = 1/2 and thus degenerate states with S = 0, 1. 


The L = 1 first rotational excitation with J; = 1/2 will give rise to degener- 
ate S = 0,1 levels, whereas for Je = 3/2 one obtains degenerate levels having 
S = 1,2. Moreover, the energy differences between different levels should be 


independent of heavy-quark flavor. Analogous conditions hold for heavy flavored 
baryons, and hadronic transitions between levels of differing L can be similarly 
analyzed. 


8 Although the light degree(s) of freedom in the simple quark model is an antiquark g for mesons and two 
quarks q1q2 for baryons, the physical (i.e. actual) light degrees of freedom could entail unlimited numbers of 
gluons and/or quark—antiquark pairs. 
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Let us explicitly demonstrate that the splitting between the J? = 17 and J? = 
07 states of a Qq meson must vanish in the limit of infinite quark mass. We note 
that the mathematical condition for spin-independence is 


[ Ho, s$] =ü; (3.3) 


where s? is the generator of spin rotations about the 3-axis for quark Q and Hb is 
the hamiltonian obtained from Lo. Since the action of s£ on a O` state produces a 
17 state, i.e., |Mi-) = 28,°|Mo-), we then have 


Ho|Mi-) = m-|M,-) = 283° Ho|Mo-) = mo-|M1-) , (3.4) 


implying that mı- — mo- —> 0 as mo — ov. 

Another consequence of working in the static limit of £o is that the propagator, 
Soo(x, y), of the heavy quark in an external field can be determined exactly. From 
the defining equations, 


(iyoDo — mg) Soo(x, y) = 8 (x — y) (Do = ðo + ig3A9-4/2), (68.5) 


one has the solution 


aate 1 
Sæl, y) = —i P (xo, ¥o)8O (X= y) po — y? imoa") ( zr) 


mae 1— 
+ 0g? = x)eimat?—y") (=) , (3.6) 


where P (xo, yo) is the path-ordered exponential along the time direction, 


XO 
Po yo) = Pexp ef dt h- Ao(x, D|; o7 
y? 


In this approximation, the heavy quark is static at point x and the only time- 
dependence is that of a phase. 

This discussion can be generalized to a frame where the heavy quark is moving at 
a fixed velocity v, described by a velocity-four vector v“ = p“/mg, with v“v, = 1. 
One can define projection operators 


Pa = 5 ED) (3.8) 


where ie = Iv, Tolos = 0, andl,,+T,_ = 1. The [y+ generalize the usual 
projection of ‘upper’ and ‘lower’ components into the moving frame. A quark mov- 
ing with velocity v will have the leading description of its wavefunction contained 


in the ‘upper’ component described by a field h, [Ge 90, Wi 91], 


Pyp = e™ h, (x), (3.9) 
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where the main dependence on the quark mass has been factored out, and h, obvi- 
ously satisfies [,,h, = hy. Substituting into the Dirac lagrangian, neglecting lower 
components, and using r,r. = v- D yields 


Lo = 4 (iP — mo) Y > Wl os (iP — me) Tv4 = hiv - Dhy, (3.10) 


which generates the lowest-order equation of motion v - Dh, = 0. This approx- 
imation can be systematically improved by inclusion of a ‘lower’ component for 
the heavy-quark field [EiH 90, Lu 90, GeGW 90], 


T -y = e"o (x), (3.11) 


with [,_£, = l. The equations of motion allow us to solve for £, by following 
the sequence of steps, 


0= (iP - mg) Y = (iP — me) e™™2”* [hy + £] 
= e`imovx (mo (¢-1)+ ie '"o"* 1p) [hy + £0] 
= e imoux [(-2mg + iD)é, + iDh,| F (3.12) 


which yields £, and y as 


i =z 
t, = ae +0 (mọ) 
y =e mort |] 4 `y h, +0 (mọ). (3.13) 
2mo Q 
Inserting these forms into Eq. (3.10) and using [,,h, = h, and Eq. (IM-—3.50) for 
PP yields 


jip- 2 -26-|, 
mo 4mo í 
(v- D)? 
Im 


=h jiv-D- = (p pe +1 ito!” Fa) |" (3.14) 
v m H 4 §3 uv vs : 
which is the desired expansion in terms of the heavy-quark mass. Because the last 
term in this expression is second order in v- D and noting that v: Dh, = 0 to lowest 
order, it will not contribute to matrix elements at order 1/mg and can be dropped. 
The lagrangian of Eq. (3.14) corresponds to a quark moving at fixed velocity. Anti- 
quark solutions can be constructed with the mass dependence e*!"2”*, with the 

result 


_ by 


A 2m 


Tey 


i . 1 u 1 a uv pa 
=k, LAES D,D” + 78340 Fi, ky, (3.15) 
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where the field k, satisfies [,_k, = ky. It is legitimate to neglect the production 
of heavy QQ pairs. However, one should superpose the lagrangians for different 
velocities in a Lorentz-invariant fashion, 


3 
pS [as ô (v,v" — 1) (v0) k + £p] = J = |z? $ rl . 618 
0 

The nature of the approximation at this stage is more of a classical limit rather 
than a nonrelativistic limit. To be sure, for any given quark one can work in the 
quark’s rest frame, in which case the quark will be nonrelativistic. However, when 
external currents act on the fields, transitions from one frame to another occur for 
which Av is not small. On the other hand, the result can be said to be classical 
because quantum corrections have not yet been included and these can renormalize 
the coefficients in LÌ 2 Also, diagrams involving the exchange of hard gluons can 
produce nonstatic intermediate states. Such corrections can be accounted for in 
perturbation theory [Wi 91]. 


XII-4 Nonconventional hadron states 


Many suggestions have been made regarding the possibility of hadronic states 
beyond those predicted by the simple quark model of QQ and Q? configurations. 
The study of such states is hampered by the fact that we still have very little idea 
why the quark model works. QCD at low energy is a strongly interacting field 
theory, and we would expect a very rich and complicated description of hadronic 
structure. That the result should be describable in terms of a simple QO and Q? 
picture as even a first approximation remains a mystery. Quark models have been 
popular because they seem to work phenomenologically, not because they are a 
controlled approximation to QCD. This weakness becomes all the more evident 
when one tries to generalize quark model ideas to new areas. 

Much of the theoretical work on nonconventional states has involved the con- 
cept of a constituent gluon G, analogous to a constituent quark Q, and we shall 
cast our discussion with respect to this degree of freedom.’ It is clear that there 
should be a cost in energy to excite a constituent gluon. The energy should not 
be extremely large, else it would be difficult to understand the early onset of 
scaling in deep-inelastic scattering. However, it cannot be less than the uncer- 
tainty principle bound on a massless particle confined to a radius R ~ 1 fm of 
E = p2V3/R x~ 342 MeV (cf. Sect. XI-1). Model calculations have tended to 
use a somewhat larger effective gluon ‘mass’. 


9 However, it should be understood that such a concept has not been shown to follow rigorously from QCD, nor 
indeed is a configuration of definite numbers of consitituent gluons a gauge-invariant entity (cf. Sect. X—2). 
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Table XIII-3. Gauge-invariant color-singlet 


interpolating fields. 
Operator Dimension gre 
qrq 3 Of a a 
qr Dq 4 PRE ae 
FF 4 OFE 2E 0T 2 
qrq F 5 ae a a a a 
FDF 5 ei 


The basic idea of confinement is that only color-singlet states exist as physical 
hadrons. If we identify those states which are color singlets and which contain few 
quark or gluon quanta, we can easily find other possible configurations besides Q Q 
and Q?. Some of the more well-known examples are 


(1) Gluonia (or glueballs) — quarkless G? or G? states, which we shall discuss in 
more detail below, 

(2) Hybrids — color-singlet mixtures of constituent quarks and gluons like Q QG 
mesons or Q°G baryons, 

(3) Dibaryons — six-quark configurations in which the quarks have similar spatial 
wavefunctions rather than two separate three-quark clusters, 

(4) Meson molecules — loosely bound deuteron-like composites of mesons. 

(5) Tetraquark states — strongly bound states with quark structures gqqq. 


A convenient framework for describing the quantum numbers of possible 
hadronic states is obtained by considering gauge-invariant, color-singlet operators 
of low dimension [JaJR 86], as was discussed in Sect. XI-1. Table XIII-3 lists all 
such operators up to dimension five which can be constructed from quark fields, 
QCD covariant derivatives, and the gluon field strength, denoted respectively by 
q, Dq, DF, and F. Also appearing in Table XIII-3 is the collection of J?© quan- 
tum numbers associated with each such operator. Particular spin-parity values are 
obtained from these operators by choosing indices in appropriate combinations. 


The first resonance — o (440) 


The lightest resonance encountered in the meson spectrum has long been one of 
the most controversial states. This state is officially known as f (500), but it is 
almost universally referred to as o. The existence of this resonance has finally 
been established unambiguously. However, the interpretation remains remarkably 
subtle. 
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The scattering of two pions in the Z = 0 and J = 0 channel becomes strong at 
low energies. The amplitude is described by chiral perturbation theory, as described 
in Sect. VII-3. At first order in the energy expansion, the scattering amplitude is!° 


S 


(0) __ 
Too = to = 16r F2 


(4.1) 
This amplitude is purely real, while under the general principle of unitarity of the 
S matrix the elastic amplitude must have the form 


Too = e!500 sin 6005 (4.2) 
and has to satisfy 
Im Too = | Tool’. (4.3) 


The lowest-order amplitude of Eq. (4.1) has no imaginary part. However, in chiral 
perturbation theory, the imaginary part starts at order E+, and the first contribu- 
tion to this appears through one-loop diagrams. Chiral perturbation theory satisfies 
unitarity order by order in the energy expansion. 

The o appears as a resonance when exact unitarity is applied to the scattering 
amplitude. The pole can be seen in an exceptionally simple approximation. If one 
simply iterates the lowest-order amplitude one can produce a fully unitary result 


to 
1— ito 


Too = (4.4) 
which satisfies Eq. (4.3) exactly and also reproduces the chiral result to first order. 
The use of Eq. (4.1) with a complex value for s as the input for Eq. (4.4) produces 
a pole on the second sheet at 


Js = (1 —i)V87 F, = (460 — i460) MeV. (4.5) 


This is the first approximation to the o. 

The complete analysis is much more subtle, but carries a similar result. By 
including not only unitarity, but also crossing symmetry and analyticity, one can 
obtain a dispersive representation of the scattering amplitude [Ro 71]. When eval- 
uated using chiral constraints at low energy and data at high energy, the mz data 
can be fully described [CoGL 01]. When extended into the complex plane, the real 
o pole appears at [CaCL 06] 


Ts 
VS = ms — T = (441 — i272) MeV. (4.6) 


10 Th order to keep the formulas simple and physically transparent in this introductory section, we present them 
with the pion mass set equal to zero. 
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However, this does not appear as a typical resonance. In contrast to others, the o 
width is larger than its mass, indicating that the pole is far from the physical values 
of s. Moreover, in the scattering amplitude itself, there is no sign of a resonant 
bump. The phase shift rises almost linearly from 599 = O at threshold to d99 = 
100° around 900 MeV. The phase shift does go through 90°, traditionally a sign 
of a resonance in elastic scattering, but at an energy ./s ~ 850 MeV which is far 
removed from the pole position. These unusual features had long created confusion 
about the existence of the ø, which has been cleared up only through the rigorous 
combination of chiral and dispersive techniques. 

The o is a dynamical strong-coupling resonance. The resonance does not fit nat- 
urally into the quark model and it does not seem profitable to try to force the o 
into that framework. While we do expect to see quark model bound states as reso- 
nances, there is no requirement that all resonant behavior must be associated with 
quark model states. Indeed, there is a strong theoretical argument that the o is dif- 
ferent from the bound states of QCD [Pe 04]. Recall two features of the large Ne 
limit discussed in Chap. X — that the meson bound states stay constant in mass 
when the large N, limit is taken, but scattering amplitudes fall like 1/N,. This lat- 
ter requirement is satisfied for the xm amplitudes; in the lowest-order amplitude 
of Eq. (4.1), the amplitude falls with Ne because Fy ~ ~Ne appears squared in 
the denominator. Because the zz amplitude is smaller at larger Ne, the amplti- 
tude becomes of order unity at a higher energy. If the o is indeed connected with 
the strong coupling of zz scattering, its mass will shift to higher energy as Ne 
increases. While we cannot change JN, in the scattering data themselves, there are 
straightforward analytic methods, such as the inverse-amplitude method [DoP 97], 
which is a variant of Padé techniques,'! to closely describe the data including chiral 
perturbation theory and exact unitarity. Use of such techniques is able to reproduce 
the o found in the data, and then when JN, is varied one finds [Pe 04], 


Mo ~ V Ne, (4.7) 


as expected by the general argument. Indeed, even our simplified approximation of 
Eq. (4.5) has this behavior, again due to F;, ~ ~N.. Because the bound states of 
QCD should behave as a constant, m ~ N°, the o appears distinct from these. It 
appears to be a resonance associated with the unitarity of elastic scattering.'” 

Some caveats and cautions about this result are appropriate. This experimen- 
tal resonance does not appear to be the o of the linear ø model. As described in 
Chaps. IV and VII, the coefficients of the chiral lagrangian are sensitive to the 
underlying fundamental theory, and the coefficients found for QCD do not resem- 
1) Our approximation of Eq. (4.5) above is equivalent to the lowest order of the inverse amplitude method. 


12 Other states that may have a related origin include the « (800) seen in Kz scattering and the N(1405) in x N 
scattering. 
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ble those of the linear o model. Nor is the existence of this state a justification to use 
a fundamental o field in field-theoretic calculations. While the use of ø exchange 
with a particular coupling may be a proxy for mz effects in a given reaction, this 
use is not necessarily valid in general. The use of a fundamental o is much more 
restrictive than the variety of pionic effects. Moreover, it is neither an accurate nor 
controlled approximation, and may double-count the pionic contributions, which 
must also be included. 

In addition, the above discussion provides a cautionary counterexample to a 
widely used argument. It is common to use the violation of tree unitarity of an 
effective theory as an indication of the energy at which New Physics should be 
seen [LeQT 77], with the expectation that the New Physics would restore unitar- 
ity. In the situation discussed above, the usual measure of tree-unitarity violation, 
Re Too < 1/2, occurs at 460 MeV, which is well below the production threshold of 
the quarks and gluons of QCD. Also, the energy of tree-unitarity violation varies 
as ./N. in units where the scale of QCD is held fixed [AyAD 12]. Thus, any ‘New 
Physics’ does not have the same N. scaling. The strongly coupled effective theory 
manages to respect unitarity without new degrees of freedom. The situation above 
indicates that, while the violation of tree unitarity does indicate the existence of 
a strongly coupled region, its use as an indicator of New Physics must be treated 
with caution. 


Gluonia 


The existence of a gluon degree of freedom in hadrons is beyond dispute, with 
evidence from deep-inelastic lepton scattering and jet structure in hadron—hadron 
collisions. However, trying to predict the properties of a new class of hadrons 
whose primary ingredient is gluonic is nontrivial. Hypothetically, if quarks could 
be removed from QCD the resulting hadron spectrum would consist only of gluo- 
nia (or ‘glueballs’ ). 

Gluonic configurations should be signaled by the existence of extra states beyond 
the expected nonets of QQ hadrons. However, mixing with QQ hadrons is gen- 
erally possible (cf. Sect. X—2). Although predicted by the 1/N,. expansion to be 
suppressed, such mixing effects serve to cloud the interpretation of data vis-a-vis 
gluonium states. Referring to the interpolating fields mentioned above, we see that 
for gluons the gauge-invariant combinations 

Ro peN, Fe Fi Fa F”, Fi Fay (4.8) 


av? Huv 
can be formed out of two factors of a gluon field-strength tensor F$, or its dual 


Fow., The spin, parity, and charge conjugation carried by these these operators 
are respectively JPC = 0+, 2+*, 07+, 27+, and are thus the quantum numbers 
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expected for the lightest glueballs,!? i.e., such operators acting on the vacuum state 
produce states with these quantum numbers. Although there is no a priori guaran- 
tee that one obtains a single particle state (e.g., a 2** operator could in principle 
create two 0t* glueballs in a D wave), the simplicity of the operators leads one to 
suspect that this will be the case. There is one, somewhat controversial, construct 
missing from the above list. Two massive spin-one particles in an S wave can have 
JPC = 17* as well as J?© = 0+, 2**, and some models predict such a gluonium 
state. However, a 1~* combination of two massless on-shell vector particles is for- 
bidden by a combination of gauge invariance plus rotational symmetry [Ya 50]. 
The lack of a 17* gauge-invariant, two-field operator is an indication of this. 

Aside from a list of quantum numbers and some guidance as to relative mass 
values, theory does not provide a very clear profile of gluonium phenomenology. 
Lattice-gauge methods offer the best hope for future progress. Present quenched 
lattice studies predict that in a quarkless version of QCD the lightest glueball is a 
O+ state of mass 1.7 + 0.1 GeV and while the 2+* and 0~* glueballs are about 
1.4 + 0.1 times heavier [Ba et al. 93], [MoP 99], [Ch et al. 06]. 

The challenge arises when couplings to quark degrees of freedom are intro- 
duced, in which case substantial mixing between quark and gluonium states must 
occur. Lattice studies of the mixing with the 0** state have yielded mixed results, 
some indicating a lowering of the mass by as much as several hundred MeV 
[Ha et al. 06], while others show little effect [Ri et al. 10]. It is generally agreed 
that inclusion of quarks has little effect on the mass of the 2*+ and 0~* states 
[Ri et al. 10], [HaT 02]. The problem has also been studied via QCD sum rules 
with inclusion of instanton effects, but again there exists considerable uncertainty 
[Fo 05], [Ha et al. 11]. 

Gluonium states would be classified as flavor-SU (3) singlets and if mixing with 
quark states exist there should exist ‘extra’ such states. An example of this phe- 
nomenon exists in the 1.5 GeV region where the states 


fo(1370), fo(1500), fo(1710), K% (1430), ao(1450) 


can be interpreted as a nonet of qq states plus a glueball [AmC 96]. In this picture 
the three fo states are mixtures of the O+ glueball and the two qq states from 
the nonet. The validity of this description relies on the existence of these three fo 
resonances. While the fo(1500) and fo(1710) are reasonably well established and 
have significant two-meson decay channels, the same is not true of the fo(1370), 
which, if it does exist, has a large (>80%) decay fraction into 47. For this 
reason the interpretation in terms of three-channel mixing of these states is still 


3 Gluonic operators with three field-strength tensors produce states with J? C = ott, 14+, 24+, ++, 2+-, 
3~~ . Because of the extra gluon field, one expects these states to be somewhat heavier: 
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Table XIII-4. Spectroscopy of six-quark 
configurations. 


SU (6) of color-spin SU (3) of flavor Spin 


490 1 0 
896 8 1,2 
280 10 1 
175 10* 1,3 
189 27 0,2 
35 35 1 

1 28 0 


controversial. Thus, despite 30 years of work on the problem of glueballs, the situ- 
ation remains confused. A recent review of the subject can be found in [Oc 13]. 


Additional nonconventional states 


There is a widespread belief that gluonium states must appear in the spectrum of the 
QCD hamiltonian, though as discussed above it has proved challenging to identify 
them. For other kinds of nonconventional configurations, it is also difficult to reach 
a meaningful consensus, although experimental efforts to detect such states are 
ongoing. We briefly review several such possibilities. 


(i) Hybrids: From Table XIII-3, we see that among the QQG meson hybrids is 
one with the quantum numbers JPC = 17+. This would-be hadron is of particular 
interest because comparison with Table XI-3 reveals that it cannot be a Q Q config- 
uration. Model calculations suggest that the lightest such state should be isovector, 
with mass in the range 1.5—2.0 GeV, and that such states may largely decouple 
from L = 0 QQ meson final states. A study of Q°G baryon hybrids reveals that 
none of the states is exotic in the sense of lying outside the usual Q? spectrum 
[GoHK 83]. 


(ii) Dibaryons: The most remarkable aspect learned yet about the dibaryon states 
is how much six-quark configurations are restricted by Fermi—Dirac statistics. 
Table XIII-4 lists the possible six-quark SU (3) multiplets along with their spin 
values [Ja 77]. Of this collection of states, the most attention has been given to 
the spinless SU (3)-singlet state, called the H-dibaryon. This particle, which has 
strangeness S = —2 and isospin J = 0, is predicted to be the lightest dibaryon, 
and if bound would to be unstable to weak decay. A series of experiments has failed 
to find the H, so at this time there is no evidence for the existence of dibaryons. 


(iii) Hadronic molecules and tetraquarks: Particles with the quark content gqqq 
also form color singlets. The literature distinguishes two types of such states: 
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hadronic molecules and tetraquarks. Roughly speaking, the molecular states refer 
to two separate gq color-singlet states that are lightly bound. Since the binding 
energy is small, such states could be expected to be found right near the threshold 
for the two mesons. Tetraquarks refer to configurations where the gqqq constituents 
are more compactly intertwined, with the details of the configuration varying in 
different models. Clearly, there can be a continuum interpolating between these 
extremes. We will not enter into the debate about the signals for the two classes of 
four-quark states. 

There appears to be clear evidence for the existence of a state in this category. 
The Z.(3900) [Li et al. 13] [Ab et al. 13] has mass and production properties that 
indicate that it contains a cc pair. However, it also carries a charge which proves 
that it also contains light quarks with the ud combination producing the positive 
charge. The internal configuration has not been sorted out yet. 

Among the particles that have been discussed as molecules are the isovector 
ao(980) and isoscalar fo(975) mesons. Nominally, these particles have the quan- 
tum numbers of the L = 1 sector of the Q Q model, and their near equality in mass 
suggests an internal composition similar to that of the o(770) and (783), i.e., 
orthogonal configurations of nonstrange quark—antiquark pairs. However, among 
properties which argue against this are their relatively strong coupling to modes 
which contain strange quarks, their narrower-than-expected widths, and their yy 
couplings. The proximity of the KK threshold and the importance of the K K 
modes has motivated their interpretation as K K molecules [Wel 83]. However, 
interpretation of scattering data near the 1 GeV region is not clear, and indeed a 
strong case has been made for the alternative gqqq picture [’tHoIMPR 08] and for 
heavier states as well. 

A clearer situation is provided by the X(3872), which has been interpreted in 
terms of a Do- D} hadronic molecule, which is bound by z° exchange at long 
distance and quark/color exchange at short distances. That X(3872) is not a sim- 
ple charmonium state is indicated by large isospin violation seen in the data. This 
occurs in the molecule interpretation because the mass of the resonance is essen- 
tially identical to mpo + mpos and considerably lighter than m p+ + mp-«. Thus, 
the molecular state would predominantly involve Do-Dé containing ccuu quarks, 
so that this structure is a mixture of isospin states 


cuü = cet [jea + dd) + [oa = ad) (4.9) 


In this picture there should be nearly comparable decays to final states with Z = 0 
and J = 1, and this is indeed indicated by significant branching ratios of the X (3872) 
to both J/wp and J Yœ modes. 
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Other examples of four-quark states may occur in the bb system and the reso- 
nances X+ (10610) and X7 (10650), which appear to be a bound states of B+-B” 
and B°*—B**, respectively. In this case the states are charged, with quark content 
bbud, so that both states are clearly exotic-they cannot be excited bottomonium. 

The overall interpretation of these states is complicated by the fact that molecules 
and tetraquarks have the same quark content and are distinguished only by details 
of their internal configuration. In some cases, both interpretations have advocates 
[AIHW 12, Du et al. 10]. 


Problems 


(1) Power-law potential in quarkonium 

Consider an interquark potential of the form V(r) = cr“. 

(a) Use the virial theorem to determine (7)/(V) for the ground state. 

(b) Given the form E25— Eis = f (d)M~4/@*®, where M is the reduced mass, 
determine d from the observed mass differences in the c¢ and bb systems, 
using Eq. (3.1) to supply heavy-quark mass values. 

(c) Assuming this model is used to fit the spin-averaged ground-state cc and 
bb mass values, determine v7 /c? for each system. 

(2) Quarkonium annihilation from the !Sọ state 
Modify Eq. (1.15) to obtain the leading-order contributions appearing in 
Eq. (1.16). 

(3) Mass relations involving heavy quarks 

(1) Repeat the analysis of Eq. (3.1) but using the masses of the charmed/strange 
mesons D,, Dž instead. Infer a value for M/M; by referring to the result 
obtained in Eq. (2.17). Compare with the determination of Eq. (3.1). 

(2) Extend the procedure of Eqs. (2.20-2.25) to isospin-violating mass differ- 
ences of c-flavored and b-flavored hadrons. 


XIV 


Weak interactions of heavy quarks 


Heavy quarks provide a valuable guide to the study of weak interactions. Mea- 
surements of decay lifetimes and of semileptonic decay spectra of heavy, flavored 
mesons! yield information on individual elements of the CKM matrix, as does the 
observation of heavy-meson particle—antiparticle transitions such as By—By mix- 
ing. Long anticipated data involving detection of CP-violating signals have been 
found to be in accord with expectations of the Standard Model and have played a 
crucial role in constraining the sole complex phase in the CKM matrix. 


XIV-1 Heavy-quark mass 


At the level of the Standard Model lagrangian, the six quark masses are equivalent; 
they are all just input parameters that must each be determined experimentally. In 
the real world of particle phenomenology, quark mass divides into two sectors, 
‘light’ (u, d,s) and ‘heavy’ (c, b, t). It is a hallmark of light-quark spectroscopy 
that hadron mass is not a direct reflection of quark mass. However, for hadrons 
which contain a heavy quark, the energy scale is set by the mass of the heavy quark. 
In the following, we discuss topics of special relevance to heavy-quark mass. 


Running quark mass 


Heretofore we have described the renormalization of quark mass in terms of the 
mass shift ôm = m — mo, where mo is the bare mass. We can also, for convenience, 
employ a multiplicative mass renormalization constant Z,, with mo = Z,,m. In 
minimal subtraction, Z„ will have an €-expansion, 


1 Note that in the conventions of the Particle Data Group the quantum numbers of the neutral mesons are 
K? = (d5), D? = (cit), B? = (db) and B® = (sb). 
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4 — Zinn (0's) as 1 
Zm (a, € )=lt DE lOt (1.1) 


n=1 


Both m = m(n) and Z,,(j2) will depend implicitly on a scale jz, but not the bare 
mass mo. A sequence of steps follows from this simple observation, 


d 
= Znmm with os = 
din 
dm mw) dZm 
dnu Zn dnu Ym (e (u))m(u), 
1 aZ P B 
= — m = (0) “s_ (1) Ms a ii 
n Zm dinw Ym An + Ym (=) + y ( ) 


where Ym is called the anomalous dimension of the quark mass operator. Since 
there is no explicit dependence in Z,,, on either quark mass m or a renormalization 
scale u, the anomalous dimension ym is the same in any minimally subtracted 
regularization scheme, such as MS. 

Let us determine the leading coefficient y”. From Eq. (1.2) we have? 


dZ iy dg dZm 
— 29 E 


ZnYnE) = = ; 1.3 
Ym(8) din Tnu dg? (1.9) 
To proceed, we shall require an extension to € Æ 0 of Eq. (II-2.57b), 
dg g 
— == z= — — — ... Z — eee 1.4 
Aaa = B(g(u), €) = —eg Pore + Se ae (1.4) 


where we recall that By = 11 — 2n f/3 > 0. We then obtain from Eq. (1.3), 


dZm 1 
(+---) (yO +--+) = 2g eg ++) ( mila.) (1.5) 
or, finally, the desired result 
y® = 60:3) = 8. (1.6) 


At this point, we have a differential equation whose integration gives the scale 
dependence of the quark mass, 


d 
HA ed. 
m(u) 
dln dg 
dinp = =, 
ee dg “8 Boa) 
gL) , 
ma) = m (Ho) ex | - J dg! 2o], (1.7) 
g(no) B(g') 


2 For notational simplicity, we suppress the subscript in g3 and use instead g. 
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where f(e) is the beta function of Eq. (II-2.57b). This equation is ordinarily used 
for situations for which QCD perturbation theory is applicable (i.e. short distances). 
Here, we consider the leading-order expressions, with 


g% =O% 1.8 
B Boz 8 Ym = Yn Fr’ (1.8) 


the insertion of which into Eq. (1.7) yields 


Os (4) 
Qs (uo) 


We hasten to note that the concept of a running quark mass is valid for all six fla- 


ym /2Bo 
| (1.9) 


m(u) = m(no) | 


vors, not just heavy quarks. For heavy quarks, it has become standard to express the 
MS mass in the form m (m), i.e., to refer to the scale u = m which equals the MS 
mass itself. This is convenient because any experimental determination m (Hexpt) 
can always be ‘run’ to the scale u = m. A compilation of various phenomenologi- 
cal inputs yields [RPP 12] 


mie(me) = 1.275 + 0.025 GeV,  mip(ip) = 4.18 + 0.03 GeV. (1.10) 


Equation (1.9) represents the leading-order expression for the running mass. 
Extensive work on higher-order corrections has been carried out, to the level of 
four loops [Ch 97, VeLR 97]. An accessible recipe for a running mass at four loops 
is given by 


c(as(u)) 


; 1.11 
c(as(Ho)) aie 


m(u) = m(no): 


where? a(n) = æs(u)/m. In the above, the argument of the function c(a,(j)) 
requires a running strong-coupling a, (jz) also evaluated at four-loop order, but this 
has been addressed earlier in Eqs. (II-2.77), (II-2.78). Useful numerical forms of 
c(x) are given in [Ch 97] for each of the s, c, b, and t quarks. For example, we 
shall refer in Chap. XV to the b-quark version, 


cp(x) = x??? (1 + 1.17549 x + 1.50071 x? + 0.172478 x°) . (1.12) 


This can be applied to run the b-quark mass from the scale uo = m, (mp) to y = 
Mp, where My is the mass of the Higgs boson. We find m,(My) ~ 0.665 m, (mp). 


The pole mass of a quark 


Since quarks do not exist as free particles, it should perhaps not be surprising that 
different theoretical definitions of quark mass appear in the literature. In the above, 


3 Note this is not the same as the quantity as appearing in Eq. (II-2.76). 
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we have discussed quark mass as it is defined in the MS renormalization scheme. 
Another definition, the pole mass, is simply the renormalized quark mass in on- 
shell renormalization. As an example where use of pole mass seems natural, con- 
sider the top quark. Top-quark mass is measured ‘directly’ in collider experiments, 
primarily via the production of tt pairs. The t quarks will each decay as t > W*b, 
which ultimately gives rise to lepton + jet, dilepton, and all-jet final states. The 
top mass obtained by fitting invariant mass distributions of final-state particles 
has been interpreted as a pole mass, with recent Tevatron and LHC evaluations 
[Mu 12] 


M = eee +0.94 GeV [Tevatron] (1.13) 


173.3 + 1.4 GeV [LHC]. 


There is also an ‘indirect’ way of determining top mass by performing a global 
fit of Standard Model observables in which the top quark contributes as a virtual 
particle. 

An interesting theoretical issue is the relation between pole mass and MS mass. 
This has been carried out in QCD perturbation theory as far as three-loop order 
[MeR 00]. In the following we shall review this process to leading order in a,. We 
begin with the inverse renormalized quark propagator, expressed as 


Splen (P) = PBren(p, m) — MAren (p°, M), (1.14 


where the functions Aren and Ben are calculated in QCD perturbation theory, with 
m being the MS mass. We must seek a zero in Bi snk p) for the on-shell conditions 


of øy = M and p? = M? with M being the pole mass. Following [FIJTV 99], we 
have for the O (œs) renormalized propagator amplitudes in the on-shell limit, 


Ao-s = Aten 


as 
= 14+ 208) (248) +++: 
p=M, p2=M?2 4x 


Bo-s = Bren 


=1+ 2208) +, (1.15) 
j=M, =m? 4r 

where & is the gauge parameter and C2(3) is given below Eq. (II-2.12). We can 
now obtain the desired relation between pole mass M and MS mass 7 in terms of 
the MS coupling @,. The condition for a zero, 0 = mA y-; — M Bo-s, implies the 
relation, 


— Ages a;(M) 
M =m =mi +C +], (1.16) 


where we exhibit scale dependence in m or a, as it would appear in a more gen- 
eral treatment. Notice that the explicit gauge dependence has canceled, as it must. 
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For the top quark, a comparison between MS mass and pole mass at NNLO level 
in the QCD perturbation theory gives 


m,(m;) = 163.342.7GeV, M, =173.3+2.8GeV, (1.17) 


as inferred from Tevatron data [AIDM 12]. 

Actually, it would appear that the very concept of pole mass for a quark is para- 
doxical because, after all, quarks are not free particles, and it is, in fact, the case 
that due to confinement the exact nonperturbative quark propagator will not have a 
pole. The pole mass exists as a creature of perturbation theory and phenomenology. 
There is, however, a price to pay for this convenience. Calculation has shown that 
there will be higher orders which grow factorially in the perturbation expansion 
[BeB 95, BiSUV 94]. Because of this, the pole mass itself cannot be determined to 
an accuracy better than the confinement scale Agcp. Other definitions of the mass 
parameter include the 1S mass, defined as one-half the energy of the 15 QQ state 
[HoLM 99], and the kinetic mass, defined via a threshold in weak decay [BiSUV 
94]. Because these include the effects of confinement, they turn out to be better 
behaved in many perturbative calculations [EIL 02]. Indeed, even for the top quark 
the 1S mass is prefered for a proper theoretical description of the tt production 
cross section near threshold [HoT 99]. 

Our lack of understanding of the large magnitude of the top mass illustrates how 
little we actually know about the mechanism of mass generation. If all fermion 
masses arise from the Yukawa interaction of a single Higgs doublet, then the 
Yukawa coupling constants must vary by the factor g;/g. = m;/me ~ 3 x 10°. 
There is nothing inconsistent about such a variation, but it is so striking as to beg 
for a logical explanation, one which is presently lacking. 


XIV-2 Inclusive decays 


Heavy quarks decay to a large number of final states, often containing many parti- 
cles. As the mass of the heavy quark gets larger, it makes increasing sense to treat 
the final states inclusively. We discuss this approach in this section. 


The spectator model 


Consider the weak beta decay, Q — qeve, of an isolated heavy quark Q into a 
lighter quark g. By analogy with muon decay, this proceeds with decay rate (if 
radiative corrections are ignored) 

2 


pmo 2 
T9273 Vao| f(mq/mọ), 


f(x) = 1 — 8x? + 8x — x8 — 24x4 lnx (2.1) 


l o>qëve = 
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where f(x) is the phase-space factor already encountered in our discussion 
of muon decay in Sect. V—2 and of tau decay in Sect. V—3. Under what 
circumstances would this be a good representation for the beta decay of a heavy- 
meson-containing quark Q? For it to be accurate, the final state must develop 
independently of the other (so-called spectator) quark in the heavy meson. Expe- 
rience with deep-inelastic scattering suggests that this occurs when the recoiling 
quark q carries energy and momentum larger than typical hadronic scales, i.e., in 
the range E, > 1-1.5 GeV. For D decays, the average light-quark energy is 
(Eq) ~ mp/3 ~ 0.5 GeV, in which case this approximation is suspect. It should 
be considerably better in B decays, but still not perfect. 

Let us explore the consequences of adopting the spectator model for D and 
B decays. If we neglect CKM-suppressed modes, the main decay channels for b 
quarks are b > cud, ccs, clv; (£L = e, u, T), while for c quarks they are restricted 
toc > sdu, s [LVp, Seve. Relative to the lepton modes, each hadronic decay chan- 
nel picks up an additional factor of 3 upon summing over the final-state colors. 
Two of the B-meson final states (ccs and ctv) have significant phase-space sup- 
pressions (reducing them to about 20% of the cud mode) due to the heavy masses 
involved. The simplest spectator model then predicts branching ratios 


1 

Bip ee — = 0.2, 
AR a 

1 


Bie ~0.17 2.2 
Pasen 2 TD) 24 02 A) 


where X denotes a sum over the remaining final-state particles. Also, this picture 
predicts the absolute rates of the D and B decays to be 


1 
t= [so Dar a | ~ 1.1 x 107" s, (2.3a) 
Gm? D 2 [0.041 |? 
Tg = [seers 2 [Væ]? i x 1.8 x 107s Tm (2.3b) 


where f(x.) = 0.7 and f(x) ~ 0.5 are phase-space factors. For definiteness, we 
have taken me = 1.5 MeV and m, = 4.9 GeV in the above. However, note the 
quintic dependence on quark mass; the B-lifetime prediction would be 10% lower 
if mp = 5.0 GeV were used! 

For D decays, the D* and D° lifetimes differ by a factor of about 2.5, 


Tp+ = (10.40 + 0.07) x 107} s, Tp+/Tpo = 2.54 + 0.02, (2.4) 


whereas the spectator model requires them to be equal. This failure is not sur- 
prising, as the D-meson mass lies in the region of strong hadronic resonances; 
final-state interactions can seriously disturb the spectator picture. Thus, we expect 
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the spectator model to reveal only gross features of the D system. It is remark- 
able, given its simplicity, that the spectator model predicts (roughly) the correct 
magnitudes of the lifetime and of the inclusive branching ratios, 


Brposei,.x = (6.49 £0.11)%, Brp+—ses,x = (16.07 + 0.30)%. (2.5) 


We see that the decays of the D* correspond more closely to the spectator predic- 
tions than do those of the D°. The D°-hadronic decay modes are notably greater 
that the expectation of the spectator model. 

Even for B mesons, the spectator model provides only a rough guide. The life- 
times of the different-flavored B mesons are reasonably similar 


Tgo = (1.519 + 0.0.007) x 107!” s, 


TBt+ 


= 1.079 + 0.007, 
Tpo 
TBO 
TB? _ 0.986 + 0.011, (2.6) 
TpBo 


and the spectator estimate differs from these by less than 20%. However, the 
spectator prediction for the leptonic branching ratio is about 60% larger than the 
experimental value 


Brpses,x = (10.72 0.13) %, (2.7) 


where the number quoted corresponds to roughly an equal mixture of B+ and B°. 
The shorter lifetime and lower leptonic branching ratio point to a modest enhance- 
ment of the hadronic modes. 


The heavy-quark expansion 


The spectator model can be transformed into a solid QCD calculation through the 
use of the operator-product expansion (OPE) [ChGG 90, Ne 05]. This allows the 
inclusion of perturbative and nonperturbative corrections. 

Using the B meson as our example, the treatment starts by considering the cur- 
rent matrix element, squared and summed over all final states, 


Wop = (21)* $ 84 (Ps — q — Px)(BQv)|JZ|X)(X|Jg|B@)), (2.8) 
X 


where q is the momentum carried by the current. The total decay rate is obtained 
by combining Wog with the squared lepton current matrix element L°®, 


Le? = 4 (pip! + pe pt — g” pe- py + ie "è Dey pua) , (2.9) 


and integrating over phase space. 
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Fig. XIV—1 The leading contribution to the heavy-quark expansion. 


The on-shell tensor Wy, is given by the discontinuity in the full tensor 


Tag = —i J d’x e™1* (B(v)|T (Ji (x) J (0))|B(v)), (2.10) 


related by Wag = —zIm Tyg. The discontinuity is evaluated at the physical cut, 
which extends over the region 


1 
mp q? <mpgvu-q < 503 +q? -= mî), (2.11) 


where m/ is the lightest hadron for the final-state quark qj, i.e., mz for qj = u or 
mp for q/ = c. In this formalism, the spectator calculation arises from the evalua- 
tion of the diagram in Fig. XIV—1 using the free intermediate-state propagator. For 
a current gab = q Ya (1 + ys)b, the tensor Tyg becomes 


Tap = -i f ats e!9* (B(v) |b(x) Tg Sr (x) gb(0)| B(v)) 


= BON - Mang ` by” + ys)blBW)), (2.12) 


=m; tie 
where Maug = 8au Pp + &BuPa — 8ap Pu — iEapuv p” With p” = mv” — q" being 
the momentum carried by the intermediate propagator. The only nonzero matrix 
element for a B hadron at rest is (B|by?b|B) = 1. In this case, the amplitude is 
equivalent to the free decay of a b quark. 

However, one can do better because the short-distance behavior of the full ten- 
sor can be described by an OPE. Because the heavy b quark carries a high energy 
and transfers that energy to the intermediate states, the region of validity of the 
OPE is somewhat different than our previous discussion for the weak hamiltonian 
[ChGG 90]. As v - q approaches the upper range given in Eq. (2.11), the over- 
all hadronic mass becomes smaller and enters the region where binding becomes 
important and perturbation theory fails. 

There are two key improvements that can be accomplished by this method. One 
is the addition of perturbative corrections. Included in this process is the ability to 
connect the b-quark mass to a perturbatively well-defined definition of that mass. 
This tames the strong m} dependence found in the spectator model by relating the 
b mass to a well-defined observable. In practice, mass definitions which are tied 
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to measurements that already include confinement effects, such as the 1S mass or 
the kinetic mass mentioned in Sect. XIV—1, provide the most stable perturbative 
definition [BiSUV 94, HoLM 99]. The other path of improvement is to include 
new operators that describe nonperturbative hadronic matrix elements [BiSUV 93, 
Ma 94, MaW 94]. These new operators enter in an expansion in the inverse of the 
heavy-quark mass. The leading operators are those discussed for the heavy-quark 
expansion in the preceding chapter. We can see how these arise by expanding the 
tensor 7,, around the heavy-quark limit including interactions. The interactions 
can be seen in the full propagator 


1 1 
X — |0 = (xX +m a 
( ET ) ( |\D D prye oW Fa, — mq + i€ 


S) = 10}, 


(2.13) 


where M contains the full covariant derivative including the gauge potential and we 
have used Eq. (III—3.50) in obtaining the second form. When the matrix element 
is taken, the derivative turns into D” = (mv — q") + d” where d” contains the 
residual momenta and the gauge field. The result is an OPE of the form 


aß ap a 
4 - = -If os 
T(J (x) J®(0)) = ct’ bb + 2 bGD)2b + Sebo Ftb. Q14) 
m; m, 2 ” 


To leading order in 1/mp, the result can then be expressed in terms of the two 
matrix elements 


2 pre 2 2 _ zit’ apv 
uz = (B(v)|b@D)"b|B(v)), ug = (Bv)|b 5 OyyF'b|B(v)). (2.15) 


The overall inclusive result has the form [BeBMU 03] 
G2 S 2 

resda = SEW y E (=) (1 + O(a,)) (1 E ae 
mo m 


19273 
2\4 2 
-2(1- =) He |. (2.16) 
m m; 


The gluonic operator also appears in the description of the spectroscopy of heavy 
quarks, as described in Chap. XIII, and its value can be estimated from the mass 
splittings in heavy hadrons. The kinetic operator can be fit as part of the energy 
distribution of semileptonic B decay in a combined fit with the total decay rate. 
The perturbative corrections depend on which definition of the renormalized mass 
is employed. Further refinements include the perturbative scaling of the coefficients 
of m; and the addition of 1/m} effects. 
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Inclusive measurements can be used to extract the CKM elements [RPP 12], 


Va = (41.88 + 0.44 + 0.59) x 107°, Va» = (4.41 + 0.15 + 0.16) x 107°. 
(2.17) 


The top quark 


The top quark is the real heavyweight of the quarks and presents a rather novel 
decay pattern. Because m; > Mw + m, and the CKM element | Vip] is near unity, 
the dominant decay is the semiweak transition t > b + W*. The amplitude and 
transition rate for this process are 


8 — 
Mi-sow+ = mi p VEROT)" + ys)u(p:), 
Grm? 3 M? Mi 
Ty sowt = Va 1 =3 Œ os 2.18 
t—>bW+ an JB tb | | ep mi (2.18) 


where we have neglected the b quark mass in the decay rate. The question of which 
definition of m, to use can be answered only when including QCD radiative cor- 
rections, and the convergence of the perturbative series is best when using a short- 
distance definition of the mass, such as the MS mass, rather than the pole mass 
[BeB 95]. QCD corrections including gluon radiation have now been carried out 
to second order in a, [CzM 99, ChHSS 99]. Including these, the top width is [BeE 
et al. 00] 


Tr, = 1.42 GeV, (2.19) 


corresponding to a lifetime of t = 4.6 x 107” s. For such a large t-quark mass, the 
emitted WT bosons are predominantly longitudinally polarized, exceeding produc- 
tion of transversely polarized W* bosons by a factor ~ m?/M;j,. This is a reflec- 
tion of the large Yukawa coupling of the ¢ quark to the (unphysical) charged Higgs 
scalar, which becomes the longitudinal component of the Wt. Other decay modes 
of the ¢ quark will be highly suppressed by weak mixing factors, e.g., for the mode 
t — s + W` the suppression amounts to |Vis/ Vol? ~ 1.6 x 1073. 

An interesting consequence of the large t —> b + W* quark decay rate is that 
there will not be sufficient time for the top quark to form bound-state hadrons. 
In view of the large top-quark mass, the tf system (toponium) is nonrelativistic 
and sits in an effectively Coulombic potential, V = —4a,/3r. In the ground state, 
one finds the quark velocity Vms = 4a,/3 and atomic radius ro = 3/(2a;m,). 
A characteristic orbital period is then T = 2779/Uyms = 97/ (4a2m t). Using 
æs(ro) = 0.12, we estimate T = 19 x 107% s. In contrast, the toponium life- 
time would be one-half the ¢ lifetime given above, since either t or t could decay 
first. These comparisons imply that the top quark has an appreciable probability of 
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decaying before completion of even a single bound-state orbit. An equivalent indi- 
cation of the same effect is the observation that the toponium weak decay width 
(twice that of a single top quark) is larger than the spacing between energy levels, 
such as E25 — Eis = a?m,/3 ~ 0.9 GeV. The production cross section does not 
then occur through sharp resonances. Instead, there exists a rather broad and weak 
threshold enhancement, due to the attractive nature of the Coulombic potential. 
This permits the production and decay of top quarks to be analyzed perturbatively, 
with T, serving as the infrared cut-off. A heavy top quark can then provide a new 
laboratory for perturbative QCD studies. 


XIV-3 Exclusive decays in the heavy-quark limit 


The spectator model calculates the decay rates as if the final-state quarks were free. 
However, the actual decays take place to physical hadronic final states. For the 
total rate, there is absolutely no hope of reliably calculating and summing all the 
individual nonleptonic decays. For semileptonic decays, the situation is somewhat 
better. The data show that the quasi-one-hadron states, i.e., D —> Kev, K*ev. and 
B — Deve, D*eve, form the largest component of the semileptonic rates, 


I D+ Kēve+K*ēve I B+ Dei, +D*eig 


= 0.89 + 0.03, = 0.74 + 0.05. (3.1) 
T Dt xév, T B+ Xeen 

These transitions can be addressed by quark model calculations, so that we have 

an independent handle on such decays. The hadronic-current matrix elements are 


described by form factors such as 
(K~(p') |5yuc| DP) = f+ (p +p’), + f-(P- P’'), 
(K*(p') |5yuc| D°(p)) = igewase” (p + P')" (p - p')’, 
(K=) |šyuyse| PO) = fier te alh Am], 6D 
with analogous definitions for the B decays. All form factors are functions of the 


four-momentum transfer q? = (p— p')?. The physics underlying these form factors 
is two-fold: 


(1) If the final-state meson does not recoil, the amplitude is determined by an 
overlap of the quark wavefunctions, as described in Sect. XII-2. 

(2) As the final-state meson recoils, the wavefunction overlap becomes smaller, so 
that the form factors fall off with increasing recoil momentum. 


For D decays, the CKM element is known to a high degree of accuracy from the 
unitarity of the CKM matrix. In this case, lattice or quark model calculations serve 
to check whether the experimental rate can be reproduced. For B decays involving 
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the b — c transition, the exclusive rates are treated using Heavy Quark Effective 
Theory, which we will describe below. 

In the case of nonleptonic B, D decays, we have considerably less confidence in 
our ability to predict the decay amplitudes. This is especially true in D nonleptonic 
decay because the rescattering corrections required by unitarity can play a major 
role. Unitarity predicts (cf. Eq. (C—-3.14)) for the D —> f matrix element of the 
transition operator, 


iT =T io =). Te, Trap. (3.3) 


where n are the physically allowed intermediate states. The scattering matrix ele- 
ments are evaluated at the mass of the D, which happens to lie in an energy range 
where many strong resonances lie. The scattering elements 7,-, are therefore 
expected to be of order unity, implying that rescattering can mask the underlying 
pattern of weak matrix elements. This makes calculation of nonleptonic D decays 
particularly suspect. 


Inclusive vs. exclusive models for b > cev, 


Inclusive and exclusive techniques appear conceptually quite different, even if we 
know that the total inclusive rate is made from a sum of exclusive individual modes. 
However, the following observation [ShV 88] is instructive for connecting the two 
methods. 

Consider the semileptonic decay of a heavy quark into another heavy quark, 
Qa — Qpeve, such that their mass difference Am is small compared to the average 
of their masses ((m, + m,)/2 >> Am), yet large compared to the QCD scale 
(Am > Agcp). Because of the second condition, one might use the spectator 
model result, 


~ 


G3 (ma z mp) 
P6406. = —— a 


a Va (3.4) 


where Va, is the appropriate weak-mixing matrix element. However, if the first con- 
dition is satisfied, the quark recoil will be nonrelativistic. This leads to a nonrela- 
tivistic calculation of the transitions from a pseudoscalar Qaq state to pseudoscalar 
and to vector Qpq states. In this limit, Yp yoa > vy Wa is proportional to the nor- 
malization operator, while the axial current Ypy; Ys Ya > vy} 0; Ya is proportional 
to the spin operator. For states normalized as 


((QaG)y |(QaG)y ) = 2M 8 (p — p’), (3.5) 
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one then has 
((QnMy [Wevowa] whe =2m, 
CODE |WovirsWa| (Q0 ) = 2m ej (p), (3.6) 


where m is either ma or mp. This translates into invariant form factors 
(Qs6Dpy |VoYu Va| (QaDp ) = (P + p’), 
(Qo) |WovnrsWa| (QaG)p ) = 2m €$ p’), (3.7) 


which are the correct relativistic results. Using these to calculate the semileptonic 
decays, one finds 
GF 5 2 
P(Qai)y->(QnDo- ee = gy3 Ma — Moy Val, 
O Se Gey ar 3.8) 
(QaQ)y- > (Qbq),-eVe — 073 a b abl + : 


Comparing these, one sees that the sum of the pseudoscalar and vector widths 
exactly saturates the spectator result of Eq. (3.4). In this combined set of limits, it 
seems that both types of calculations can be valid simultaneously. Direct applica- 
tion of this insight to b + cev, decays is somewhat marginal, as the nonrelativistic 
condition is not well satisfied. A velocity as large as v = 0.8c is reached in portions 
of the decay region, although on the average a lower value is obtained. However, it 
is likely that the near equality of spectator versus quark model results is a remnant 
of the situation described above. 


Heavy Quark Effective Theory and exclusive decays 


The discussion of the previous section leaned heavily on the use of models to 
describe quark weak decay. However, many aspects of weak transitions can be 
obtained in a model-independent fashion through the use of the mg — oo limit, 
which was introduced in Sect. XIII-3. This effective theory provides a variety of 
qualitative and quantitative insights of considerable value. 

The heavy-quark approximation manages to justify many results which have 
become part of the standard lore of quark models. For example, consider the decay 
constant of a Qq pseudoscalar meson M, 


(0|g(x)y"ysQ(x)| M(p)) = iV2 Fu p” e?*. (3.9) 


In the quark model one finds that Fy « (my)~'/*. This follows from the normal- 
ization of momentum eigenstates, 


(M(p’)|M(p)) = 2Ep8® (p — p’), (3.10) 
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such that 


üns = Na Fymy (decay const defn.), (3.11) 
i/2my W(0)./2N. (quark model reln.), 
where w(Q) is the Qq wavefunction at the origin and N, is the number of colors. 
Since, as mg — ov, the Qq reduced mass approaches the constant value u > mq, 
we expect that y (0) itself approaches a constant in this limit,* and the scaling 
behavior Fy « (my)~'/” then follows immediately from Eq. (3.3). Alternatively, 
the dependence of Fy on my can be derived using the wavepacket formalism 
introduced in Chap. XII. 
This quark model result can be validated in the heavy-quark limit [Ei 88]. 
Consider the contribution of meson M to the correlation function 


C(t) = fas (0|Ao(t, x) 450) 0), (3.12) 


where Ao = Gyoys Q. Inserting a complete set of intermediate states and isolating 
the contribution of meson M, we have 


d°*p 
C(t) = dx OJA x)| M (p)) (M (p) |Ao(0)| 0) +--- 


where the ellipses denote other intermediate states. From the definition of Fy, one 
finds 


F2 2 , 
C(t) = MM mim g... (3.14) 
2mm 


Alternatively, the heavy quark develops in time in this correlation function accord- 
ing to the static propagator of Eq. (XIII-3.6), 


C(t) = — $ eo" (01G(t, 0)yoys P (t, O)(1 + ys)yoys4 (0)10), (3.15) 
2 


with all the dynamics being contained in the light degrees of freedom. The matrix 
element is independent of my, and the scaling behavior, 


Fy x (my), (3.16) 


follows immediately. This technique is applicable to lattice theoretic calculations 
of Fy. There, one considers euclidean (t —> —it) correlation functions, and iden- 
tifies the M contribution by the e™™™" behavior. At present, lattice calculations 
attempting to obtain physical results from the mg — oo limit and from the light- 
quark limit do not agree in regions of overlap. We thus feel it is premature to quote 


4 For example, in the nonrelativistic potential model, the S-wave wavefunction at the origin is related to the 
reduced mass by DAO = uldV /dr) /2xh2. 
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theoretical values of Fp, Fg. Another piece of quark model lore which can be jus- 
tified by this correlation function is that the mass difference my — mg approaches 
a constant value in the mg — oo limit. This can be inferred by comparing the 
exponential time dependences in Eq. (3.14) and Eq. (3.15), and noting that the 
difference must be be independent of the heavy quark. 

The heavy-quark limit also makes predictions [IsW 89] for transition form fac- 
tors between two heavy quarks (which for definiteness we shall call b and c). Recall 
the lagrangian developed in Eq. (XIII—3.15), the leading term of which is 


Ly =hOiv- DAO +hOiv- Dh, (3.17) 


This lagrangian exhibits an SU(2)-flavor symmetry involving rotation of h© 
and h‘). It is also spin-independent, and thus contains an additional SU (2)-spin 
symmetry. The two SU (2)s may be combined to form an SU (4) flavor—spin invari- 
ance. Physically, the internal structure of hadrons containing a heavy quark and 
moving at a common velocity is seen to become independent of the quark flavor 
and spin. This property leads to many relations between transition amplitudes. 

An example of a process appropriate for the heavy-quark technique is the weak 
semileptonic transition B — D induced by a vector current. For a static matrix 
element (i.e., both B and D at rest), the weak current transforms quark flavor 
b — c, but leaves the remaining contents unchanged, resulting in unit wavefunc- 
tion overlap. This can be seen calculationally by noting that the time component of 
the spatially integrated current is the conserved charge of the SU (2)-flavor group 
mentioned above, 


fas (D(p') |E(x) yob(x)| B(p)) = 8P — p) V4mpmp 
= 6(p — P^) [f+ (im) (mp + mg) + f_(tm) (mp — mp)), (3.18) 


where tm = (mg — mp)’ is the value of t = ( p- py at the point of zero recoil, 
and the general decomposition of a vector-current matrix element, 


(DP) |éy,b| BM) = f+ (p+ p’), +O (p-P'),, (3.19) 


has been used in the second line of Eq. (3.18). We have seen results similar to 
Eq. (3.18) in the discussion of the Shifman—Voloshin limit in the previous section. 
However, there the restriction mg — mp << mg + mp was required, whereas here 
no restriction is implied as long as both quarks are sufficiently heavy. 

This framework may be extended to nonstatic transitions [IsW 90] with the 
observation that the heavy-quark symmetry can be applied in any frame moving 
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at fixed velocity. First, in addition to Eq. (3.19) for the B — D transition, we 
require also the D —> D and B — B vector form factors, 


(D(Pp) |ēvuc| D(Pv)) = folto) (po + Pp), + 


(BP) |by.b| BPs) = fa (ts) (Pe + Pp), > (3.20) 
where fg(0) = fp(0) = 1. Considering the momentum transfers tp, tg, and 
tsp = (Pe — Pp) in terms of the velocities, using p} = mjv", pp = mpy, 


and pp = mpv, we have 
te = (pp — ph) =2m>(1—v-v), 
tp = (pp — pp) = 2m?, (1-v-v’), 
tap = (Pr — Po) = (mg — mp)? + 2mgmp (1—v-v'). (3.21) 


If each transition has common velocity factors, the various momentum transfers 
are related by 

m? mp 

tp = —5 ta = — (tap — tm). (3.22) 

mgp MB 
In view of the normalization convention of Eq. (3.2), one must divide the state 
vector of particle i by /2m; (assuming m; > |p|) before applying the b < c 
symmetry. Upon doing so and requiring the resulting expressions to be identical 
functions of the velocities v and v’ leads to the relations 


(DP) levicl D(Pv)) _ (BPs) |byib| BPs) _ (DPD) lévib| B(ps)) 


2mp 2mp J/4m pm zg , 


(v+v). (v+v/). (mgv+mpv), 
tp) —— = tgr) —— = fop = O 
fo(tp) 5 fa (te) 5 fi. (tep) NET 
(mgY — mpy’), 
+ f- (tep) =. 3.23 
f-(tep) eae (3.23) 
After simple algebra, this results in the form-factor relations 
mp 
fet) = fp — tÍ s 
mg 
mg +mp mp 
t) = t — tm ’ 
mB—Mp Mp 
_(t)=— t — tm) |. 3.24 
f-(@) ian Fx | (3.24) 


Although consistent with Eq. (3.21), this manages to separate out f+. The results 
are expressible in terms of a single function of velocity. It is notationally simpler to 
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express the kinematic dependence using v - v’ instead of t, i.e., fit) > fi(v-v’). 
Thus, we have 


4 
fs- v) = fow = a fwv’), 
mg +mp 


at aD e jja eu, (3.25) 
mg — mp 
where, aside from the constraint €(1) = 1, the function &(v - v’) is unknown and 
must thus be determined phenomenologically. If we exploit the full SU (4)-flavor— 
spin symmetry, then all the weak-current form factors involving B, B*, D, and D* 
can be expressed in terms of the quantity &(v - v’), e.g., 


(D* (Pp) |Ev.b| B(pa)) = ivmpmgE (v - v')Epvap€* (Pp) Ui, Up, 
(D* Pp) |Evuysb| B(ps)) = /mpempé(v-v') [0 + v ves —e* v v] 
(3.26) 


The symmetry language is appropriate here because, similar to the symme- 
try relations detailed in the first part of this book, we have related different pro- 
cesses even though there remains an uncalculated ingredient to be determined from 
experiment. However, effective field theory techniques allow a more detailed study 
of the same matrix elements beyond just the leading symmetry relation. Hard per- 
turbative effects can also be included [Wi 91, CzM 97]. Suppressed corrections due 
to deviations from the heavy-quark limit can be calculated in the effective theory. 
The shape of the form factors [CaLN 98] can be determined experimentally, but 
what is most important phenomenologically is the normalization of these form fac- 
tors at the zero-recoil point v - v’ = 1. This deviation is second order in the inverse 
masses [Lu 90] which, since me < mp, means that it is of order 1/ m?. While 
analytic estimates of this deviation can be achieved [ShUV 95, GaMU 12], lattice 
methods now can provide well-controlled calculations of this effect [Be et al. 09]. 

For the b — u semileptonic transition, there is no corresponding heavy-quark 
theory that provides a solid starting point for analysis of the B — rev decay. 
Quark model calculations are particularly unreliable for this transition. Fortunately, 
improved lattice calculations now appear capable of calculating the transition 
matrix element in the region of small recoil [DaGWDLS 06, Ba et al. 09]. Sup- 
plemented by theoretical constraints [BeH 06], experimental work can measure the 
q? variation and use the lattice matrix element to provide the normalization when 
using this process to measure Vav. 

Phenomenologically, exclusive decays are key ingredients to the extraction of 
the CKM elements. The present best values from exclusive decays are [RPP 12]: 
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Væ = (39.6 + 0.9) x 107°, Væ = (3.23 £0.31) x 107°. (3.27) 


The reader will note there is a modest disagreement between the values of these 
elements between the inclusive determination of Eq. (2.17) and the exclusive val- 
ues of Eq. (3.27). For Va», the effect is sizeable and may be indicative of a gap in 
our theoretical methods. The smaller disagreement seen in Væ may also be an indi- 
cation that more theoretical work is needed at understanding the duality between 
inclusive and exclusive methods. 


XIV-4 B°-B° and D°-D’ mixing 


Just as K°—K° mixing occurs due to the weak interactions, so does mixing exist 
in the Ba— B4, B;—B, and D°—D° systems. We shall discuss first the Ba— B4 and 
B,—B, mixings, then conclude with the D? case. The formalism is the same in 
all situations and can be taken directly from the discussion of K°—K° mixing in 
Sect. IX-1. 


B°-B° mixing 
The mixing occurring in Bg and B, mesons is short-distance dominated. This is 
because (i) the dominant weak coupling of the b quark is to the f quark, and (ii) the 
short-distance box diagram (Fig. XIV—2) grows roughly with the squared-mass of 
the intermediate-state quarks. Since the very heavy mass of the top quark greatly 
enhances its contribution, the top intermediate state dominates B-meson mixing. 
The effective hamiltonians for Bz, and B, mixing are? 


2 


= G 2 
Hy? = 2 (Vo Vä) m H&g O” +hic., 


_ G? 
Hy? = E (Vo Vi) mH ONO + hee. 
OF = dy, (1 + ys)bdy"(1 + y5)b, 
OF = 5y, (1 + ys)b 5y" (1 + ys)b, A 


where ng ~ 0.9 is the QCD correction and H (x+) is given in Eq. (IX-1.20). The 
matrix elements of O74 and O* can be parameterized analogously to that used in 
kaon mixing, 

16 16 


(Ba |O™| Ba) = F Fimu Beo (Bs|0™ | B) = Z F5,m5, Beo (4.2) 


5 A more advanced treatment of Bs—Bs mixing than given here appears in [LeN 07]. 
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b w d (s) 


Fig. XIV-2 Box diagram contribution to B-meson mixing. 


where the pseudoscalar decay constants are normalized as 
(0 |dy"ysb| Ba(p)) =iv2 Fep", (0 15y“ysbl Bs(p)) = iV2 Fa, p”. (4.3) 


These correspond to the normalization Fy ~ 92 MeV. 
Both B4 and B, mixing have been observed, with the results, 


Amp, Amp, 
xa = —— = 0.775 + 0.006, Xs = —— = 26.82 + 0.23. (4.4) 
Vz, Tz, 
The width difference of By is consistent with zero, while that of B, is nonzero but 
small, 


AT g AT, 
; = 0.015 + 0.018, 


d s 
In Eqs. (4.4)-(4.5) above, we have denoted Am = my — mz and AT =Ty-T_, 
where H (L) refers to the heavier (lighter) of the neutral B CP eigenstates, 
The large magnitude of x, /xq is readily understood in the Standard Model to be 
mainly due to the CKM elements, as the ratio is predicted to be 


Ames, _ Fi Bz, 

Amp, | F5, Ba, 
The SU(3) breaking in the matrix elements is well under control in lattice calcula- 
tions [LaLV 10], 


= 0.123 + 0.017. (4.5) 


2 


Vis , (4.6) 


Via 


F;../B 
SBN E 1057 0058. (4.7) 
Fond Bi 


The remaining dependence in the ratio of the mass splittings comes from the CKM 
elements and in fact this ratio is the most precise measurement of the relative sizes 
of these CKM elements 


Vis 


td 


= 4.739 + 0.126, (4.8) 


consistent with other determinations. This is an important test of the Standard 


6 We use the updated version of [Am et al. (Heavy Flavor Averaging Group collab.) 12] found in 
www.slac.stanford.edu/xorg/hfag. 
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Fig. XIV-3 Short-distance (a) and long-distance (b) contributions to D-meson 
mixing. 


Model as New Physics could readily contribute to Amg, and/or Amz,. The abso- 
lute magnitudes of these mixings are also compatible with the Standard Model. 
Using the mixing formula developed in Chap. IX and the lattice magnitude 
[LaLV 10] Fez ay Bag = (149 + 9) MeV, the experimental number for Amz is 
reproduced with |V,q| = (8.4 + 0.6) x 1073, which becomes a tight constraint on 
fits of the unitarity triangle, to be discussed shortly. 

The width differences are smaller than the mass differences because real on- 
shell intermediate states are required; thus, top-quark intermediate states do not 
contribute to AT4,s. For this reason, the widths AT4,s are suppressed compared 
to Amas by a factor of roughly m? / m?. The width difference for By is smaller 
than that for B, because the CKM favored decay mechanism b — ccs when active 
for a bd meson leads to an intermediate state (césd) that cannot convert to a db 
meson, while when occurring in the decay of bs leads to intermediate states (ccss 
or cé) that can transition back to sb. Thus, AT4 is CKM-suppressed compared to 
AT. The measurements of AT 4 s are also compatible with theoretical expectations 
[LeN 07]. 


DD’ mixing 

The analysis of D°—D° transitions is considerably more complex than that involv- 
ing B4 s mesons because the mixing is not short-distance-dominated [Wo 85, 
DoGH 86a]. To see this, we display the corresponding box diagram in 
Fig. XIV—3(a), and some possible long-distance contributions in Fig. XIV—3(b). 
The GIM cancelation in the intermediate state is between the two light quarks d, s 
(the b-quark contribution is suppressed by CKM angles). However, there is no 
compensating large mass factor here; long-distance and short-distance effects con- 
tribute at the same order of magnitude. As a result, reliable quantitative predictions 
of Amp have eluded theorists thus far, despite the attempts of many to solve the 
problem. Even such basic issues as correctly predicting the sign of Amp or deter- 
mining to what extent a component from New Physics could be present [GoHPP 
07] remain unresolved. 
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For example, consider the application of the OPE (which has worked so well for 
B4. s mixings) to D° mixing [Ge 92, OhRS 93, BiU 01, BoLRR 10], 


(D°|Hjacj=2lD°) = G ¥°Ci(u)(D°|Q;|D°), (4.9) 


where the prefactor G has the unit of inverse squared mass, the sum is over operator 
dimension, and both Standard Model and New Physics operators are included. The 
expansion begins at dimension six, with two operators for just the Standard Model 
and eight upon including New Physics. However, even within just the Standard 
Model, the number of operators increases sharply as the dimension grows, e.g., 
there are about a dozen at dimension nine and more than twenty at dimension 
twelve. This introduces a multitude of unknown parameters. It is also the case 
that the sum in Eq. (4.9) is not expected to converge rapidly because the ratio 
Agcp/m, ~ 0.25 is not sufficiently small. 

Some aspects of D°—D° mixing can, however, be understood. For example, 
the Standard Model clearly requires that Amp/I'p <« 1 because Amp is twice 
Cabibbo-suppressed (i.e. Amp = O(A*)) while Ip suffers no such suppression. 
Hence, upon counting CKM factors and noting that the GIM cancelation is a mea- 
sure of the breaking of SU (3) symmetry, one is led to estimate that’ 


Amp 


8 7 x [SU (3) breaking] = O(107”). (4.10) 
D 

Of the various meson-mixing systems, the D°—D?° transitions were the last to 
be detected experimentally. However, by studying the decay time dependence of 
D? — Kt+x~/D° + K~nxt, arecent experiment [Aa et al. (LHCb collab.) 13a] 
excludes the no-mixing hypothesis with a probability of over nine standard devia- 
tions. The current-mixing values in [RPP 12] are 

_ Amp 


Ar 
xp = > = (0.632539) x 107, 2 
D D 


= (1.50 + 0.24) x 107°. (4.11) 


The suppression in D°— D° mixing is evident upon comparing the above value for 
Xp with those for xg and x, in Eq. (4.4). 

Observation of D°—D° mixing motivates the search for CP violation in the 
D-meson system. Here, we cite two recent results. In one, the CP-violating asym- 
metry Ap in the time dependent transition D? + K+z~ is measured to be Ap = 
(—0.7 + 1.9) %, which is consistent with zero [Aa et al. (LHCb collab.) 13c]. In 
the other, the CP-violating asymmetry Ar(f), between the D? and D decay rates 
to a given final state f, yields results also consistent with zero [Aa et al. (LHCb 
collab.) 13d], 


7 Actually, it can be proved that if SU(3) violation in D? mixing enters perturbatively, then a group theoretic 
analysis of (0| DHwHw D|0) shows that SU(3) breaking occurs only at second order [FaGLP 02]. 


420 Weak interactions of heavy quarks 


Ar(ata~) = (0.33 + 1.06 + 0.14) x 1073, 
Ar(K* K~) = (—0.35 + 0.62 + 0.12) x 107°. (4.12) 


The uncertainties in the above determinations are dominated by statistical, rather 
than by systematic, effects. Thus, although the current status of CP violation in 
charm is inconclusive, there is reason to be optimistic that additional statistics as 
obtained in forthcoming studies will yield nonzero results. 


XIV-5 The unitarity triangle 


The B-meson transitions form a nontrivial system and provide much of our 
information on the pattern of weak mixing. The overall B lifetime and b —> c 
semileptonic decays are governed by Ve, the suppressed b —> u modes by Vab, 
Ba — B4 mixing by Vig, and B, — B, mixing by Vs. Together with the Vas element, 
these form all of the ‘interesting’ sectors of weak mixing. 

There is a useful pictorial representation of the constraints of unitarity on these 
elements. Consider the effect of the unitarity relation 


Va Vč + Va VŠ + Ve Vä = 0. (5.1) 


u C 


Of the components to this equation, Vaa, Via and Veq are known up to corrections 
of second order in A = |V,s|, yielding 


V — À Va + Vå =O. (5.2) 


If we treat these elements as complex vectors, this relation is equivalent to a triangle 
in the complex plane. In the Wolfenstein parameterization the various elements are 


Va = —Vs = Aà?, Van = A7A(p —in), Va=1°A(1— p—in). (5.3) 


The unitarity triangle is shown in Fig. XIV—4. Note that the unitarity triangle can 
be constructed knowing only the magnitude of the elements | Vel, | Va»l, and | Vial. 
The existence of such a closed triangle is independent of the parameterization. 
Other unitarity triangles, corresponding to the other unitarity constraints, also exist 
but are either less useful than this one or are equivalent to it [Ja 89]. 

The unitarity triangle has an important connection with CP violation. If the 
CP-violating parameter 7 vanishes, the triangle is reduced down to a line since all 
the angles go to either 0° or 180°. In fact, the area Af A’ of this triangle is exactly 
the unique rephasing invariant measure of CP violation. The angles a, 6, y are 
themselves indicators of nonconservation of CP and play a role in the B studies 
to be described in the next section.* Note that the magnitudes of the sides of the 


8 In the literature there is an alternate naming of angles yj = £, g2 = @, g3 = y. We are following the 
conventions of the Particle Data Group. 
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Fig. XIV—4 The unitarity triangle. 


triangle and the interior angles of the triangle are all independently measurable and 
the fact that the separate measurements are consistent is a powerful test of the Stan- 
dard Model. Our Fig. XIV—4 is drawn using the present fits of the sides and angles, 
and illustrates the relative magnitudes of these elements. 


XIV-6 CP violation in B-meson decays 


The decays of B mesons exhibit a rich variety of CP-violating signals, some of 
which are rather large [BiS 81]. These reactions have provided dramatic confirma- 
tion of the validity of the CKM mixing scheme as the dominant origin of CP vio- 
lation. Recall that the value of € cannot be regarded as a prediction of the Standard 
Model because there is an unknown parameter, the CKM phase 6, which must be 
adjusted to fit experiment. The value of €’/€ is consistent with the Standard Model 
and is an important verification of the existence of direct CP violation, but theo- 
retical uncertainties are presently too large for this to be a precision test. However, 
the Standard Model, with its single CP-odd parameter, makes clear predictions for 
the patterns of CP violations in B decays, and observation has confirmed many of 
these. 

There is an important division in the study of CP violations for B mesons: 
(i) processes which proceed via B°—B° mixing, and (ii) those which do not. We 
shall discuss those involving mixing first, and then return to those not related to 
mixing. 


CP-odd signals induced by mixing 


General formalism: The analysis of time evolution for a B? or B° meson parallels 
that of a neutral kaon. Given the conventions for Am and AT following Eq. (4.5), 
one obtains for states that start out at t = 0 being either B° or B®, 


|B°(t)) = g4 (t)|B°) + He OB") 


|B°t)) = 7s O1B + 94(t)|B°), 
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p.. |[Mpn-ilp 


q \Mp-irh 


pal 
eU) = 5¢ 


Ti t/2pimit [1 Leo ent : (6.1) 


The strategy for observing CP-violating asymmetries is to compare the decay 
B°(t) —> f, where f is some given final state, to that of B°(t) —> f, where f 


is the CP-conjugate of f, 
If) 
Let us define the matrix elements 
A(f) = (fHwlB°), 
A(f) = (f HwlB°), 
and their ratios,’ 


ACP) 


Af) = AG)’ 


= CP|f). (6.2) 


A(f) = (flHwlB°), 


ACP) = (flHwlB°), (6.3) 
ACP) 

6.4 

p(f) = ea (6.4) 


The decay rates for the two processes are easily found to be [BiKUS 89] 


1 1 . = 
TB X [a + be~4™ 4 ce™?^" cos Am t + de™?^" sin Am r| e TH, 


a = JAPP (; RE 
2 p 


i +Re [Zan}) , 


2 
= aur (5 hili an] |-Re[£ou]), 
Dp P 
2 


c= JAP)? ( - f p 
P 


d =2]A(f)? Im [ta], 


and 


(6.5a) 


aOR- N oal 5-1 = 
Tio X Ja+be AT! 4 ge^" cos Amt + de 24" sin Am tle Put 


a= JAD) (; hle b| | +Re[Zach]}. 


9 We caution the reader not to confuse the notation for these ratios with the CKM element p in the Wolfenstein 


parameterization of Eq. (II-4.19). 
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_ ss 1 =, 7 _ 
b= JAD (; Í + [Zo l —Re [2a] ! 


2 
é= anthi = 2 oe): 


d= 2/4 Al im| 2a]. (6.5b) 


Any observed difference between these two quantities would indicate the presence 
of CP violation. 

Before considering some examples, there is a simplifying approximation which 
it is useful to make. As seen in the previous section Mı2 >> T2 for B and B,, so 
it is a good approximation to neglect T12 (and hence AT) in almost all cases.!° In 
this approximation q /p becomes a pure phase, q/p = e’®, so that |g/p| = 1. 


Decays to CP eigenstates 


The most striking processes are those where the final state f is a CP eigenstate, 
If) = +| f), suchas f = Y Ks, Y Kr, D? D7, 1*~. In this case one has p(f) = 
1/pe(f). Time-dependent CP asymmetries have two components 


r(B%(t) > f)-TBO = f) 


A = = = in(A = A 
p(t) FB) > Nr f) Sp sin(Amt) — Cy cos(Amt), 
(6.6) 
where 
2 
mjo] t= [fan be 
1+ leao] 1+ lah 


We see that there are two possible ways that the asymmetry can be nonvanishing, 
corresponding to the S; and C amplitudes. 

The cleanest analysis occurs when |o(f)| = 1, i.e., |A(f)| = |A(f)|. An exam- 
ple is Bg —> WK°®, which proceeds dominantly through b — cés, so that both 
factors are pure phases 


Vos Veb q _ VaV 


Pf) = , = . 
Veg Vä P VaVe 


(6.8) 


10 The one exception is the semileptonic asymmetry to be discussed below. 
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Fig. XIV-5 Tree (a) and penguin (b) diagrams for B > rz. 


In this case, it is clear that Cyg = 0. The asymmetry involves the relative phases 
of VX Væ and V.iVin, which we see from Fig. XTV— is the angle £, such that the 
result becomes 


Swk = sin 2B. (6.9) 


This prediction is independent of hadronic uncertainties and depends only on the 
phases in the CKM matrix. The result is large, with the resulting measurement 
[RPP 12] of the angle 6 of sin26 = 0.679 + 0.020, consistent with other con- 
straints on the unitarity triangle. CP violation in this mode is one of the cleanest 
and most direct confirmations of the Standard Model. 

One might at first expect that |O(f)|? = 1 is automatic if f is a CP eigenstate. 
However, it is possible to obtain |o(f)| Æ 1 if there are two different ways to reach 
the same final state. For example, one could have the decay B? —> a+z~ either 
directly through b —> uud or through the penguin diagram, which includes the 
CKM elements for c or t intermediate states, cf. Fig. XIV-5.!! By CKM unitarity, 
we have ViVea = —(Vň Va + VěVa). Therefore, if we absorb the portion of 
the penguin diagram proportional to Vž Vaa into the tree-amplitude reduced matrix 
element, which carries the same CKM factor, we have the amplitude expressed in 
terms of two CKM elements, 


Arta) = VigV IT le? + Vä Vo Ple, 
ATAT) = Va Vi IT ei + Via Vig Ple, (6.10) 


where T and P are tree and penguin amplitudes and ôr, ôp are strong-interaction 
phase shifts. Because the weak phases change sign under CP and the strong phases 
do not, we have the ratio of amplitudes |o(f)| Æ 1. Indeed, experimentally one 
finds 


Syta- = —0.65 + 0.07, Cytr- = —0.38 + 0.06, (6.11) 


11 Tn discussions such as this, it is understood that the weak hamiltonian receives QCD radiative corrections, 
which can mix operators with identical quantum numbers. However, since we are using only the CKM 
factors and symmetry properties of the amplitudes, these corrections do not influence the analysis and are 
absorbed into the reduced matrix elements. 
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Table XIV—1. Standard Model pattern for CP 
violation in B decays. 


Transitions Examples Im (q/p)p(f)* 
b —> ccs Ba > YKs sin 2B 
B; > Wo ~0 
b — ccd By > DD sin 26 
By > WKs ~0 
b > uid Ba > ata sin 2a 
B; > 2°Ks sin 2@ 
b —> uus Ba > 1°Ks sin 2@ 
B; > n?o? sin 2y 


“The angles a, 6, y are defined by the unitarity triangle of 
Fig. XIV-4 and we take |p(f)| = 1. 


indicating the presence of both CP-violating phases and sizeable strong 
rescattering phases. The solution to this ‘penguin pollution’ involves looking at 
other zz modes. There is an isospin relation among the three-pion channels 
(cf. Eq. (VHI-4. 1) 


A(T) — Arn) = V2A(r tn), (6.12) 


similar to the kaon decay analysis of Chap. VIII. The penguin amplitude is purely 
AI = 1/2 and hence only the tree amplitude can contribute to the 7 = 2 final state 
a*+2°. Measurement of branching ratios and CP asymmetries S77, Crn allows 
one to disentangle the CP violation due to tree and penguin amplitudes [GrL 90]. 
For the tree amplitude, involving V Vaa, the interference is with the By-mixing 
amplitude, dominated by the top quark, so that the measurement is of the CKM 
phase a. 

At this stage we can categorize the decays of neutral B mesons to CP eigenstates. 
For this purpose it is most convenient to use the Wolfenstein form of the CKM 
matrix. In this parameterization, the elements Vip, Veb, Vis, Ves are all almost purely 
real. The B4 and B, decays can proceed either through the CKM-favored transition 
b — ccs or the CKM-suppressed transitions b > uud, b —> ccd, b — uus. In the 
former category are included By > wKs and also B, > wy, wn, D} D>. The B, 
decays pick up no phase since 


q _ VsVit 


= Veb 
= = l] and = 


=e m| ta] =, (6.13) 


However, the B4 decay does pick up a phase, leading to a distinctive signature of 
the Standard Model. The CKM-suppressed decays can also be analyzed in terms 
of the angles which appear in the unitarity triangle, and are given in Table XIV—1 
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for the case |o(f)| = 1. However, in some cases we know that |o(f)| 4 1, such 
that further efforts are required to extract the given angle, as described for the zz 
system above. It should also be pointed out that under all circumstances, asym- 
metries for B, are more difficult to observe because x, is large due to the rapid 
oscillations in the B, <> B, system. Thus, regardless of whether one starts out at 
t = 0 with B, or B,, after a few oscillation lengths one will have roughly equal 
amounts of B, and B,. 


Decays to non-CP eigenstates 


There may also exist CP violation in final states which are not CP eigenstates. 
Consider, for example, the final state Bg —> x~ K*. This transition can occur both 
through tree amplitudes, with the CKM factor VÄ Vas and through penguin decays 
of the form b — sqq. Because the CKM elements satisfy Vi Vis = —(VG, Ves + 
Vä Vas), We can write the amplitude in terms of two reduced matrix elements such 
that the corresponding decays of the B? and B° will have the form 


A(T K*) = Vå Vas IU] et + Vå Va IC] el , 
A(T KT) = Væ Vă |U| e ®t + Va Vä |C e’, (6.14) 


where the reduced matrix element C comes from the penguin diagram alone and 
U comes from a mixture of tree and penguin effects. The decay rates for these two 
processes will then be different by a factor 


|A(x~ K*)? — A(t K7)? = —4|U]|C] sin(Sy — 8c)àfA?n, (6.15) 


where we have used Im Và Vas Vo Vă = à A?n in the Wolfenstein parameteriza- 
tion. This effect has required two paths to the given final state, with differing strong 
phases and differing weak phases. Because the hadronic matrix elements are diffi- 
cult to calculate reliably, this rate difference cannot by itself be a precision test of 
the Standard Model. 

However, there is a way to make an approximate test of the Standard Model 
using corresponding decays of the B, meson. The key point [He 99, Gr 00] is 
that the tree process b —> uud and the penguin amplitude for b — dqq proceed 
identically to the corresponding processes used above for b —> uus and b > sqq 
aside from CKM factors. In the U-spin subgroup of SU (3) the d and s quarks form 
a doublet, and all other quarks are singlets. The two sets of interactions then form 
two components of a U -spin doublet, and their matrix elements are related. Bg and 
B, are also related by U-spin, so that the matrix elements for By —> m~ K* and 
B, — K~nx* are U-spin reflections of each other. The corresponding rates for B, 
decay are given in the U-spin limit by 


XIV-6 CP violation in B-meson decays 427 


A,(K-m*) = Vå Va |U] e + VEVea IC] e , 
As (KTT) = V Vý [U| e?! + VV IC] ef. (6.16) 


u Cc 


The weak CKM elements are different, but the hadronic matrix elements are 
the same. However, the Standard Model has only a single CP-violating phase, so 
the the imaginary parts of the products of CKM elements are always related. In 
this case, they are identical up to a sign Im VÄ Vaa Vo VG = —A°A?n, such that the 
decay rate differences are the same 


[A(T K*)|? — |A(x* KT)? = (As (Krt)? — |A (Kt). (6.17) 


However, asymmetries are defined by dividing by the the sum of the decay rates, 
and the overall decay rates are different in these two cases. Correcting for the over- 
all rates yields a sum-rule [Li 05] 

Br(Ba > n7 Kt)t, 


=A p(B, > K-xt +A B >n K? = 0, 
Q C ( s ) cP( d Br(B, K ) 4 


where Br is the CP-averaged branching ratio. Despite the individual rates and 
asymmetries being different, the sum-rule appears valid within error bars [Aa et al. 
(LHCb collab.) 13c] 


Acp(B, > K~x*) = 0.27 + 0.04 + 0.01, 
Acp(Ba > z7 K+) = —0.080 + 0.007 + 0.003, 
Q = —0.02 + 0.05 + 0.04. (6.19) 


While the use of U-spin symmetry is only approximately accurate, this sum-rule 
nevertheless is a strong test of the overall pattern of direct CP violation within the 
Standard Model, including loop diagrams. 


Semileptonic asymmetries 


For a final example involving mixing, let us consider CP violation in semileptonic 
decays. In much of our previous analysis, we have neglected the quantity r12. How- 
ever, for semileptonic decays, the whole effect vanishes if we neglect T12, so we 
must include it. For this case, only the transitions B? > "wX, B? —> DX 
(£ = e, u, T) can occur. The ‘wrong sign’ transitions in the time developments, 
B(t) > LDX, B°(t) > l*veX, are then uniquely due to mixing. The appro- 
priate formulas can be obtained from our general result Eqs. (6.5a), (6.5b) by the 
substitutions 


428 Weak interactions of heavy quarks 


Ale) > 0, A(e~)p(e-) > Ale), 
A(et) > 0, A(et)p(et) > Alet) = A(e>). (6.20) 


The integrated rate is 


2 
oo qd p 
o dt [T zox -= T 50¢)-+2+ypx | 2 |s] al (6.21) 


= Fo - 2 2 
in dt [P30 (4) + 0-5, x + T posety] |s] $ 
p 


AsL 


Pp 
q 


This sort of CP violation is thus solely sensitive to mixing in the mass matrix, as 
was the semileptonic K? asymmetry. Unfortunately, in the Standard Model it is 
small for reasons connected to the CKM elements. Expanding in powers of T12 
and defining gr = arg (Tir/ Miz), one has 


sin gp. (6.22) 


We have seen that AT /Am is suppressed by factors of m? /m? since the top quark 
cannot contribute to the real intermediate states required for AT. For B,, there is 
a further suppression in the Standard Model because the dominant contributions 
to T12 (cc intermediate states coming with CKM elements (V Ves)*) and Mp (tt 
intermediate states with (Vý V,;)*) have almost the same phase because Vý Ves = 
— Vý Vest + O(A7)]. Thus, gr, is also suppressed to a fraction of a percent. These 
features are seen in the theoretical predictions [LeN 11] 


AS" [Thy] = (4.1 + 0.6) x 1074, AS-[Thy] = (1.9+0.3) x 107°. (6.23) 
The present experimental results [RPP 12, Ve (LHCb collab.) 13], 


AS"[Expt] = 0.0007 + 0.0027, AS"[Expt] = —0.0024 + 0.0054 + 0.0033, 
(6.24) 


are not yet precise enough to confirm the Standard Model predictions. 


CP-odd signals not induced by mixing 


Situations where CP violation occurs without the presence of mixing can occur in 
B= decays through the interference of different decay mechanisms. The require- 
ments are the same as we saw previously in a different context, i.e., there must 
be two different paths to the same final state, these paths must have different 
strong-interaction final-state phases, and the two paths must also have different 
weak phases. Consider, for example, the decays B+ —> D°K+ and Bt + DKY. 
While initially one might think that these two reactions are distinct, if the D° and 
D? decay to a common final state, such as K sz, the overall amplitudes to that 
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final state will in fact interfere. The decay with a D° in the final state involves the 
b — ics reaction, with CKM elements Vi, Ves, While the D’? reaction proceeds 
through b —> cus and Vý Vas- The relative phase between these amplitudes is the 
angle y. 

Interestingly, despite the need for final-state phases in this reaction, the CP vio- 
lation can be extracted without hadronic uncertainties [GrW 91, GiGSZ 03]. The 
key to this is that the subreaction D? > K Orta- can be separately measured in 
tagged D reactions as a function of the kinematic variables, and then can be treated 
as a known quantity. In addition the D? and D? decay amplitudes are related to 
each other!? at mirror kinematic values. In particular, if the decay D° > KÌn+tr- 
is given the name A(m7 , m2) with mł = (px + ps)? then the corresponding D° 
amplitude is A(m;,, m2) = A(m? , m2). The amplitudes, including the possibility 


of final-state interaction phases, have the form 


2 2 2 2 Right Bx Beye 
[Apt (Ksata-yK+|” = (Ao An, mZ) + rA(m{,, m= eT” |’, 


|A B- (Ksrta oK- | = lAo lA (mi, m) +rA(m?, me?’ |, (6.25) 


where an overall amplitude for Ag = Ag+ pox+ has been factored out and where 
r is the ratio of the magnitudes of the amplitudes r = |Ag+_, pog+|/|A g+> pog+l: 
Here, the possible strong-phase difference 5 has been made explicit. Knowledge 
of the D decay amplitudes plus the observation of both B® decays then lets one 
separate the strong phase from the weak phase and also divide out the underlying 
weak matrix elements. This has become a favored way to measure the angle y with 
the present result [Aa et al. (LHCb collab.) 12], 


Yeats) (6.26) 


when all related channels are included. 

To summarize, we have discussed thus far a variety of tests for CP-violating 
signals in the system of B mesons. The partial rate differences can be quite large. 
At first, this seems to go against the general dictum that all CP violations in the 
Standard Model must be proportional to a single, numerically small product of 
CKM angles. However, B decays satisfy this stricture in the sense that the mix- 
ing and decay of B mesons are in themselves proportional to small CKM angles. 
Overall, the product of mixing, decay, and CP violation does turn out to be propor- 
tional to all of these CKM angles. However, in forming the asymmetry by dividing 
out the rates themselves, one is canceling the small CKM angles, thus leaving a 
rather large effect. This argument also explains why there is little CP violation in 
D decays in the Standard Model. The CP observables must be small due to the 
usual product of CKM angles. However, the overall decay rate itself has no small 


12 Here we neglect CP violation in the D-meson system, which is a good approximation for CKM-favored 
decays. 
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(a) (b) 


Fig. XIV-6 Some one-loop diagrams for rare B decays. 


angles, so that the signal remains small. B-meson decays have proven to be optimal 
for the exploration of the rich CP-violating structure of the Standard Model. 


XIV-7 Rare decays of B mesons 


The number of B-decay modes is so large that any single mode will be ‘rare’ in 
the sense of having a small branching ratio. Nonetheless, considerable attention 
has been given to modes that proceed only at one loop, as in Fig. XIV—6, and these 
are the ones that are normally labeled as rare decays. The expectation is that, by 
measuring the transition rates of such processes, one can test the Standard Model 
at loop level, and hopefully observe deviations due to New Physics. Moreover, 
since prediction of rare decays involves many of the techniques we have developed 
for calculating weak transitions, these decays can provide a nontrivial test of our 
ability to apply the Standard Model. 


The quark transition b > sy 
The process b — sy is described by the magnetic-dipole transition 
on 
afd 87? 
x U(Ps)Ouv [mp (1 — ys) + ms (1 + ys)] u (Pb), (7.1) 


Mb>sy = Fo Va VE" (q)“q” 


where the quark mass factors occur in the combination shown because the o,,, 
Dirac matrix connects left-handed fields to right-handed fields, and a factor of mass 
must appear whenever a chirality change L — R occurs. 

The quantity F2, which represents the quark-level loop amplitude with numerical 
factors containing G p and e extracted, is given by 


Fy x Fo (x1) P (xe) & Fa (x), (1.2) 


with x; = m? /M}, and 


= x 2%% 5x 7 3x: 
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Y 
(a) (b) 


Fig. XIV-7 Standard Model diagrams for b > sy. 


The flavor content of F, and the overall factor of Væ Vă in Eq. (7.1) can be eas- 
ily understood. The overall loop amplitude, which involves a sum over the inter- 
mediate quark flavors t, c, u, must vanish in the limit of equal quark mass from 
a GIM cancelation since it involves a neutral flavor-changing process. In reality, 
however, the contribution from the very light u quark is negligible, and the top- 
quark contribution to F clearly dominates. The CKM unitarity relation Vẹ V = 
— V Vă — Vu Vý can be used to substitute for Vp V upon neglecting the small 
factor Va V. The b + sy decay rate, relative to the b —> cev, semileptonic rate 
can be expressed in the simple form 
Tp+sy — 3a] Fy? , (7.4) 
losie f(m_/mp) 

where f(x) is the phase-space factor given in Eq. (2.1), and factors of m?/m? 
arising from phase space and from the amplitude of Eq. (7.1) have been dropped. 

Short-distance QCD corrections can be used to improve this free-quark calcula- 
tion. These produce a surprisingly large modification to the analysis of b > sy, 
and the reason is instructive. The ¢ quark is so heavy that, at all scales relevant to 
the weak decay, its effect may be treated as a point bsy vertex, with renormaliza- 
tions as in Fig. XIV—7(a). However, the c quark is light on all scales from Mw to mp 
so that in its renormalization one must also include the diagrams of Fig. XIV—7(b), 
where the dot represents the b — ccs weak hamiltonian. That is, there is mixing 
between the b > sy vertex and the b — ccs transition. The theoretical prediction 
is [Mi et al. 07], 


By-+sy[Thy] = (3.15 + 0.23) x 1074, (7.5) 


for photon energies above 1.6 GeV. The corresponding measurement (highly 
nontrivial) is [Am et al. (Heavy Flavor Averaging Group collab.) 12] 


Bp x,y [Expt] = (3.55 + 0.24 + 0.09) x 107%, (7.6) 


where the last error bar is due to uncertainties in the treatment of the photon energy 
distribution. 

At the hadronic level, the quark transition b —> sy is observed in channels such 
as B > Kry, Krry, etc. The simplest final state occurs when the Kz system 
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S 


Fig. XIV-8 The penguin diagram for B, > 4+4. 


forms a resonant J” = 17 state, K*(890).!? As the inclusive rate appears to be in 
agreement with the Standard Model, this effort is a test of the calculation of exclu- 
sive transitions. Within the same class of decays is the transition B > K*€*é-. 
Theoretical interest in this transition comes from the hope that New Physics not 
present in B —> X,y could show up here [DeHMV 13]. The amplitude includes 
Z° as well as photon exchange, and the loops could be sensitive to new inter- 
actions. Experimentally, the decay is rich and challenging because a full angular 
distribution can be probed, with the possibility of sensitivity to different physics in 
different kinematic regions. 


The decay B; > ¢* £7 


The leptonic transition B, —> ¢* 7 is also particularly promising as a sensitive test 
of the Standard Model. The rate is suppressed even more by a factor of m? due to 
a helicity argument which relies on the current—current structure of the theory, and 
this allows New Physics to be present. 

The decay proceeds through the Z° penguin diagram of Fig. XIV-8 with the 
dominant contribution from the top quark due to its large mass. The photon penguin 
does not contribute because the photon as a vector has C = —1, while the lepton— 
antilepton pair with zero angular momentum has C = +1. The transition then 
occurs through the axial-vector Z 0 current, with an effective hamiltonian, 


Gra 


H = ne a lypyse, (Fel) 
w 


where, as usual, C4 is a coefficient which includes the QCD short-distance correc- 
tions. When computing the decay amplitude, we encounter the matrix element 


(Olby"yss|Bs(q)) = i Fe." (7.8) 


and the q” contracted with the lepton current produces a factor of mz, in direct anal- 
ogy to the pion decay discussed in Chap. VII. Note that scalar or pseudoscalar inter- 
actions would not have such suppression and so these New Physics possibilities 


13 The B > K y transition is forbidden because it is a spin-zero to spin-zero transition. 
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could potentially have a large enhancement over the Standard Model prediction. 
The theoretical prediction [BuGGI 12, DeFKKMPT 12] 


poe) = (3.54 + 0.30) x 107? (7.9) 


Bs> pt pe 


is quite robust, with the major uncertainty being the lattice calculation of Fg,. This 
mode has recently been measured [Aa et al. (LHCb collab.) 13b] with the result, 


Bord = Gz) x 107°. (7.10) 


Bs et 
An even more recent result, although preliminary, shows that combined LHCb and 


CMS data agree with the Standard Model prediction by more than 5c. This clearly 
indicates that there is no large effect from New Physics. 


Problems 


(1) Patterns of CP violation 
All signals of CP violation involve the interference of two or more amplitudes. 
Identify the origin of the interference in partial rate asymmetries for the decays 
(a) Bs > pọ, (b) Bs > p*nF, (c) By > Ky, (d) B* > p*r’, (e) 
Bt —> K+n°, 

(2) Amplitude relations in the heavy-quark limit 
In the heavy-quark limit, a static b quark in a B meson can be described in 
terms of just the two upper components of its four-component Dirac field. This 
can simplify various matrix elements or be used to relate them. Use this feature 
to show that the B —> K*y matrix element of the o”” operator, 


(K* (e€, k)|5o0""b|B(p)) = e% [A el pg +B efke tet - pC poke], 
can be related to the vector and axial-vector form factors of B > pli, 
(p* (€, K)ity"b| B°(p)) = iD e” pyedkp, 
(o* (€, k)|uy"ysb|B°(p)) = E e" + ét - p [Fp" + Gk"), 
through 
A = —(E — komgD)/mg, B=-—-mgD, C=(D + G)/mpg, 


under the assumptions of a static b quark and of SU (3) symmetry. In this 
relation, all form factors must be evaluated at the same momentum transfer, 


qg = (p—ky. 


XV 
The Higgs boson 


On July 4, 2012, the LHC collaborations ATLAS and CMS announced the discov- 
ery of a resonance which, despite limited statistics, seemed to have characteristics 
expected of a Standard Model Higgs boson. Mass determinations presented at the 
2013 Lepton-Photon Conference are 


125.5 + 0.2 (stat) *)2(syst) [Ja (ATLAS collab.) 13] 


1.1 
125.7 + 0.4 [De (CMS collab.) 13]. pn 


My (GeV) = 


Since this resonance has a nonzero branching fraction for decay into two photons, 
it must be a boson, one not having spin-one. In fact, current spin/parity analyses 
are compatible with J? = 0* but not with J? = 07, 1+, 17, 2+ [Aa et al. (ATLAS 
collab.) 13b], [Ch et al. (CMS Collab.) 13]. Its couplings to bosons and fermions 
appear to be consistent with Standard Model expectations, in particular that the 
Higgs should couple to mass. At present, the overall precision is limited to about 
25%, so an extended period of careful study will be necessary to reveal the anoma- 
lous properties, if any, of this particle. In this chapter, we will consider the basics 
of the Standard Model Higgs, with the intent of describing its phenomenology and 
also addressing certain theoretical issues. 


XV-1 Introduction 


A central feature of the Standard Model is the spontaneous symmetry breaking in 
the electroweak sector which gives mass to fermions and to the W* and Z° gauge 
bosons. Although a complex doublet of Higgs fields is initially introduced in the 
Weinberg—Salam model, there remains following spontaneous symmetry breaking 
precisely one physical Higgs state, a neutral scalar particle H°. That is, if we define 
the number of degrees of freedom for Higgs and gauge-boson states, respectively, 
as Ny and Ng, then before the symmetry breaking we have Ny = 4, Ng = 8 
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whereas afterwards we find Ny = 1, Ng = 11. To obtain these values, recall 
that massive vector particles have three spin components whereas massless vector 
particles have just two. Although the total of Higgs and gauge-boson degrees of 
freedom remains fixed (Ny + Ng = 12), there is a transfer of three states from the 
Higgs sector to the gauge-boson sector. These Higgs states become the longitudinal 
spin modes of the W*, Z? particles. 

This transfer can be displayed analytically by first performing a contact trans- 
formation to cast the two complex Higgs states y°, y* in terms of four real fields 
H? and x = {x;} (i = 1,2, 3) 


— 77-1 0 
=U wfe oa) (1.2) 
where 
U(x) = exp(ix : t/v), (1.3) 


and we recall that v = 1/./2!/?Gy ~ 246 GeV. One completes the procedure with 
the gauge transformation, 


0 
0’ = U(x)o = 
(v + H°)/V2 
Wp =UV, Vr=Vr, BL = Bu 
ZW, = UG 5 WUO + igr aU U0, 4) 


for all fermion weak isodoublets Yz and weak isosinglets wr. Within this unitary 
gauge, the physical content of the theory is manifest, and the quantity ®’ is seen to 
contain a single Higgs field H®.' In the following, we shall employ this gauge but 
with the primes in Eq. (1.4) suppressed. 
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We have already specified in Chap. II how the Higgs boson H fits into the Standard 
Model. The various lagrangians written down there provide the basis for a complete 
phenomenological portrait to be drawn for the H boson. In this section, and the 
ones to follow, we present the theory for this program. 


! For notational simplicity, we shall hereafter omit the superscript ‘0’ and denote the Higgs field simply as H. 
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Higgs mass term 


Consider first the Higgs potential of Eq. (II-3.19) which, when expressed in terms 
of the field H, becomes 


2 
À 
+ u’ Hé + Avis + qo (2.1) 


V= 


4 
where the parameters u, A are a priori unknown. The term quadratic in the Higgs 
field determines the Higgs mass to be 


My = V2 = vv2). (2.2) 


This does not provide a numerical value for the Higgs mass My because only the 
quantity v, but not A, is phenomenologically determined. 

This fact places the burden of determining the Higgs mass on experiment. We 
will interpret the LHC finding of an unstable boson as indeed the Standard Model 
Higgs boson and for definiteness adopt the value 


My = (126.0 + 0.5) GeV (2.3) 
for subsequent discussion. If so, the remaining parameters in Eq. (2.1) become 


w=89.1403GeV and A=0.131+0.001. (2.4) 


The naturalness problem 


Radiative corrections to the Higgs mass raise a question of the ‘naturalness’ of the 
Standard Model. To motivate the discussion, let us first consider one-loop electro- 
magnetic corrections to the electron mass. If we impose a cut-off A, on the momen- 
tum flowing through the loop, the mass shift, 


| 3a. Ae | 
Me = Meo | 1+ -— In +e, (2.5) 
i 20 Meo 
is obtained. The magnitude of this first-order correction, although cut-off depen- 
dent, is generally tiny. Taking for A, the entire mass of the observable universe, 
Ae ~ 10” Gey, results in only the modest mass shift me ~ 1.7me o. This teaches 
us that, with logarithmic behavior, the renormalization program of absorbing diver- 
gences into renormalized parameters is not implausible. 

However, radiative corrections to the Higgs mass are not as tame. We display 
in Fig. XV—1 one-loop self-energy processes which shift the Higgs boson mass. 
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(a) (b) (c) 
Fig. XV—1 Some quadratically divergent Higgs self-energy diagrams. 


Considering for definiteness diagram (c), which involves a Higgs loop with quartic 
self-coupling, we have 

d*k i 
(27)* k? — M? o + ie 


—i Xy (p) = -3i f (2.6) 
This expression is quadratically divergent, Ey ~ A%,, where Ay is the cut-off 
parameter for the above integral, and leads to a shift of the Higgs mass, 


À 
Mi, = Mho + S (2.7) 
If Ay is as large as, say, the Planck mass Eplanck ~ 10!° GeV, then in order to 
obtain a renormalized mass as given by Eq. (2.3), the parameter Mao must be 
negative and have a magnitude which equals the correction up to 31 decimal places! 
This is referred to as fine tuning. While technically possible, it is surely unnatural. 
Including the other contributions of Fig. XV—1 we obtain the Higgs mass shift 


Mi = Mho + a [Mz + 2My + Mz — 4m?]. (2.8) 
It is possible to cancel this mass shift by arranging the value of My contained 
within the brackets in Eq. (2.8). This strategy gives My ~ 314 GeV, which is ruled 
out by experiment. 

The inability to make sense of Higgs mass corrections is perhaps the most seri- 
ous flaw in the fabric of the Standard Model. At present, there are no known com- 
pelling mechanisms for curing this ailment. Accordingly, many physicists have 
been motivated by this ‘unnaturalness problem’ to search for alternatives to the 
Standard Model description, and to suggest that New Physics must exist not very 
far above the weak scale v ~ 250 GeV. 


Higgs coupling constants 


There are a variety of ways that the Higgs can interact, including vacuum energy, 
Higgs self-couplings, Higgs couplings to massive particles, and finally Higgs 
couplings to massless particles. 
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Table XV-1. Higgs-boson coupling constants. 


Sf fH SWWH SwwH2 8ZZH &ZZH? ZH gyi 
mp 2M3, 2M3, 2M3 2M3 3M? 3M?, 
v v v2 v v2 v v2 


Vacuum Higgs energy: The first term in V, the Higgs potential of Eq. (2.1), is a 
constant energy density, which can be interpreted as a contribution A“? to the 
full cosmological constant A. Inserting known values for u and v, we have 


Dad 


Ug | = AT — — ~ 1.2 x 108 GeV4, (2.9a) 


which is huge compared to the observed value [RPP 12], 
[A >S)| ~ (2.3 x 1077 eV)* = 2.8 x 10-*” GeV". (2.9b) 


This should not, however, be viewed as a defect of the Higgs mechanism, as there 
are many such contributions to the vacuum energy. Presumably, there is some over- 
riding issue of physics which forces the suppression or cancelation of the vacuum 
energy by so many orders of magnitude. 


Higgs coupling to massive particles: Next, we express couplings of the Higgs 
boson to particles which have nonzero mass. In cases where n identical fields 
appear, a numerical factor 1 /n! is introduced to account for the number of identical 
fields. The set of all such coupling constants is collected in Table XV-1. 

The Higgs potential of Eq. (2.1) contains cubic and quartic Higgs interactions, 
which we express as 


ER? H? — Sut 
3! 4! 


There are also couplings of the Higgs to massive fermions. From Eq. (2.3) and 
Eq. (II-3.20), we find for the interaction to fermion f, 


Lyin = —8FpnA vey. (2.10b) 


Lyeat = — H*. (2.10a) 


The catalog of Higgs particle interactions is extended by presenting its couplings 
to the W* and Z° bosons, including both trilinear and quadrilinear terms for each, 


— & 2 
Lwwu = W, wt | oe H? + gwwa | ; 
E? u | EZZH? 772 , SZZH 
LZZH = ZZ | (212 H4 + 71 H ; (2.10c) 
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where we have employed Eqs. (I-3.18), (II-3.29), (I—3.32). Observe that each of 
the couplings 844, 87H» 8wwH?2» 8zzxH2 are pure numbers whereas gy3, 8wWwH, 
&zzu have the unit of energy. 


Higgs coupling to massless particles: The coupling between the Higgs boson 
and a particle depends on the particle’s mass. This means that at the basic level of 
the Higgs lagrangian, there is no coupling to photons and gluons because these par- 
ticles are massless. However, such couplings are induced through quantum effects. 
This is a phenomenon we have seen already in Chap. IV, in which the photon- 
photon interaction, yy — yy, although zero at a fundamental level, is described 
to one-loop order by the Euler—Heisenberg effective lagrangian of Eq. (IV—8.5). 

Higgs—photon—photon vertex: A Higgs boson will couple to a two-photon final 
state through W~-boson and charged-fermion loops. The decay rate 


2 


l [Aaw + D Nea Ayatx))| (2.11) 


f=q.e 


Mi, 
re ae 
contains the loop functions A; (x) and Aj/2(x), 


A(x) = - [Dae + 3x + 3(2x — I f(x)] , 


2 
Aix) = 2 [x+ aœ TFs 


arcsin? (./x) (x <1) 
f@= 1 I+ ü- 1/x)!/2 2 (2.12) 
—— | In| ——— | -ir |, («>1) 
4 1 — (1 — 1/x)!/2 
where x is the dimensionless variable x = M;, /(4m?) and the subscripts on 


A, (x) and Aj/2(x) denote the respective spins of the loop particles. The sum over 
fermions f in Eq. (2.11) is taken over both quarks q and leptons £. 

The above procedure is based on calculating the decay amplitude from Feynman 
diagrams as in Fig. XV—2. It is worthwhile to consider the possibility of an alter- 
native approach. Throughout this book, we have emphasized the use of effective 
field theories. Can we employ this method here, via a local effective lagrangian, 
to describe the Higgs—photon—photon vertex? Note that the function f(x) defined 
in Eq. (2.12) develops an imaginary part for m < My/2, which is the case for 
all the loop fermions except the ¢ quark. The imaginary part signals that H would 
be able to physically decay into any of the light fermion—antifermion loop pairs. 
If so, the conversion of a Higgs into two photons is nonlocal and cannot possibly 
be described with a local lagrangian defined at scale u = My. Although the W~ 
and f quark evade such a prohibition, the issue remains whether it would be a good 
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Fig. XV-2 H — yy via (a) charged fermion, (b)—-(c) W boson. 


numerical approximation to use a local lagrangian for either. Let us compare the 
loop functions A; and A;/2 evaluated both in the heavy mass limit x — 0 and also 
using the physical values xw œ 0.60 and x, œ 0.13, 

A, (0) — Ai (xw) = A12(0) — A1j2(a1) 


~ 0.16 


: ~ 0.03. 
A, (0) Aj/2(0) 


Since the difference between the infinite-mass and physical t-quark amplitudes is 
only 3%, most would agree that an effective lagrangian description for the f-quark 
contribution is appropriate, and we write 
Loe = 9 HEY PF ; © — 2a 
eff = Sy uu with g, = Iny’ (2.13) 

where « is the fine-structure constant and F,» is the electromagnetic field strength 
tensor (cf. Eq. (I-5.9)). Note that the heavy top quark evades the decoupling theo- 
rem of Sect. IV-2 because the tH vertex is proportional to the large mass 
parameter m,. 

An alternate derivation of Eq. (2.13) begins by considering the contribution of a 
tt loop to the photon vacuum polarization [ShV VZ 79], 


qpa N2 

O“ (q) = (q"q” — q°g"”) Ès oMr |. (2.14) 
t-quark T m? 

where q; = 2/3 is the top-quark electric charge in units of e and we have chosen 

regularization with cut-off A here (instead of the dimensional approach used else- 

where in this book) to keep the notation compact. The photon vacuum polarization 

of Eq. (2.14) can equivalently be expressed via the effective lagrangian, 


- 1 qa, A? 
coo paw EnA, 2.15) 
ph. vac. pol. 4” T m? 


2 For reference we note the expansions about x = 0: Ai &) ~ 4/30 + 7x/30 + 2x? /21 +--+) and 
Aj (x) © -7 — 22x/15 — 76x2/105 +--+. 
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as can be shown by taking its photon-to-photon matrix element. Now, writing the 
top-quark mass term together with its Higgs interaction, 


i Bye 
Lui =-— (m En Ht) Tasi (1 i =) it, (2.16) 
v v 


suggests treating the Higgs field as a constant and thus formally extending the top- 
quark mass as m, —> m, (1 + H/v). Inserting this into Eq. (2.15) and considering 
only the term linear in H yields precisely the effective lagrangian of Eq. (2.13). 
This ‘background field’ derivation is valid if the momenta involved are small com- 
pared to the top-quark mass, which is not perfect but a good first approximation.? 

Higgs—Z°—photon vertex: This process, too, occurs first as a loop amplitude 
([CaCF 79], who assume My < Mz and study Z? — Hy; see also [BeH 85]) 
via triangle diagrams dominated by W~-boson and t-quark contributions. We refer 
the reader to the literature for the explicit, somewhat cumbersome, analytic form 
of the vertex. 


Higgs—gluon—gluon interaction: The Higgs two-gluon amplitude has similarities 
with the Higgs two-photon interaction. One calculates Feynman amplitudes for 
triangle diagrams, although now summed over only quarks {q} since gluons couple 
neither to leptons nor to the electroweak gauge bosons, leading to 


oE 
q 


The top-quark amplitude is by far the largest in the above sum, and so we can 
again turn to the effective lagrangian description. The contribution of a tt loop to 
the gluon vacuum polarization in cut-off regularization is 


2 


2M? 
Zu , (2.17) 


T 


T H> gg = 


uv HY 2. pV Os A? 
TI” (q)ab = ĝa (q"q" — q8) | mm t (2.18) 
t-quark 6x my 
which leads, as explained earlier, to 
i 1 a A? 
CO Se Pe E (2.19) 
gl. vac. pol. 4 # 67 m? 
and finally, from Eq. (2.16), to 
— pil auv pa : (t) as 
Let = 8 H F Fi, with See =~ one’ (2.20) 


3 For completeness, we take note of yet another derivation [EIGN 76] of Eq. (2.13) which uses the QED trace 
anomaly (see Eq. (III-4.16) for the QCD version), 
as 


gt 
Be 1297 


Fav FH” + mitt, 


taking into account only the t-quark part of the fermion contribution. 


442 The Higgs boson 


where F““” is the chromodynamic field strength tensor of Eq. (II-2.2a). This rep- 
resents the linear term in an expansion in powers of the Higgs field H. Higher 
powers provide the two-gluon coupling to an arbitrary number of Higgs bosons. 
The quadratic term in this expansion would be a prediction for gg — HH. There, 
in addition to the direct coupling of Eq. (2.20), one encounters a pole diagram (i.e. 
gg — H — HH) which contains the triple Higgs coupling. The direct and pole 
contributions cancel exactly at threshold and, more generally, the residual effect 
remains small. 


XV-3 Production and decay of the Higgs boson 


Following the discovery of the top quark, finding the Standard Model Higgs boson 
became a primary goal of experimental particle physics. The search strategy was 
based on Standard Model predictions of both production and decay amplitudes. We 
discuss each of these in turn, beginning with the topic of Higgs decay. 


Decay 


One begins calculation of a Higgs decay mode with the lowest-order amplitude, 
and then incorporates higher-order QCD and electroweak (EW) corrections. These 
higher-order effects are described, with many references, in [Dj 08]. Here, we dis- 
play branching fraction predictions in Table XV—2 [He et al. 13], but restrict our 
presentation here to only the lowest-order analysis (except for two decays H — bb 
and H — gg, which have especially large corrections). The major two-body Stan- 
dard Model decay branching fractions in Table X V—2 correspond to a total width, 


rE” ~ 4.21 (£3.9%) MeV. (3.1) 


The individual branching fractions in Table X V—2 are purely theoretical quantities. 
An experimental reality at LHC is that detection of the modes bb, gg, cĉ is greatly 
inhibited by huge hadronic backgrounds. As a consequence, other modes (e.g. yy) 
can play a central role in Higgs phenomenology at the LHC, despite their smaller 
branching fractions. 


Table XV-2. Two-body Higgs branching fractions.“ 


bb ww 2g TET če ZZ? yy yz pt 
56.1 23.1 8.48 6.15 2.83 2.89 0.23 0.16 0.02 


“All branching fractions are in % and the value My = 126. GeV is assumed. 
>The asterisk denotes a virtual vector boson. 
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Decay into fermion-antifermion pairs: For transitions of the type H —> f f, the 
leading-order (LO) decay rate is 


Nem 
reo) = My (1-4 
T V 


I, (3.2a) 


where my is the fermion mass (which arises from the Yukawa coupling), x, = 
my/My and N: = 1 for leptons and N. = 3 for quarks. We already know that in 
the Standard Model the Higgs coupling to a fermion—antifermion pair is linear in 
the fermion mass m s. The factor of mi in Eq. (3.2a) reflects this and ensures that 
the bb mode is largest amongst all fermions with 2m; < My (the mode H —> tt 
is kinematically forbidden). 

Let us consider the H —> bb mode in a bit more detail. If Eq. (3.2a) is used 
to determine the bb decay rate and Eq. (3.1) is used for Po then a branching 
fraction ~ 104% is predicted. This unphysical result is disconcerting to say the 
least! The flaw in our numerical exercise is that we have ignored corrections to the 
tree-level prediction of Eq. (3.2a). Ordinarily, one expects a ‘correction’ to be no 
more than a few tens of percent and usually much smaller. This case is not like 
that; it turns out that the most important correction is to replace the m}, factor by 
the squared running mass my (u) with u = My, 


z2 
(M 
fee Ma — 4x3)" [1 ese se 


. t > (3:2b) 
where the O(a@;) correction is also displayed. For the b quark, we have already 
found below Eq. (XIV-1.12) that m,(My) ~ 0.665 m, (mp), implying a corrected 
H — bb branching fraction of 56%. This means that all the remaining corrections 
for this mode amount to a rather more modest effect. The moral of this lesson is to 
not place unwarranted trust in tree-level estimates. 


Decay into three-body states: Although the Higgs boson couples to the electroweak 
gauge bosons, a Higgs with mass My ~ 126 GeV is too light to decay into WW 
and ZZ final states. However, a transition like H > WW* > Wf f’ (or H > 
Zf f) can occur, e.g., H > Wtdū or H —> W~cS and so on. We shall consider 
this possibility here. If dependence on fermion mass (such as m ¢/My or m ¢/My) 
is ignored, the energy distribution of the final state W is [KeM 84] 


ar 1 Mw \4 2 Aer 
H>WfF _ Ww (x4 — 4e*) 2 2 2 4 
—__—. —4 8 12"), 
dy = Ton? ( ) H (d—x)? (x e x +e“ + 12e ) 


(3.3) 


where x = 2Ew/My and € = My/Mz. Integration over the W-boson energy 
yields 
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1 {Mey 
(LO) = wW 
Dy wef ~ 49273 ( v ) Mu F(€) 
F( = 3(1 — 8e? + 2064) 3e? = 1 
95 Get — 1h 263 
47 3 1 
2 2 2 4 
(ae | Fe = +5 ]-30-6 +4e*)Ine. (3.4) 


Thus far, we have kept the final state fixed as Wf f’. To obtain the inclusive 
rate Ty w+x, we sum over all distinct final states (like the ones displayed above 
Eq. (3.3)) to find 


3 (Mw \* 

(LO) _ wW 

The case of H —> Zf f is obtained from the above relations via insertion of a 
factor nz = $ — £ sin? Oy, + 4% sinf 6,,. 


Decay into four-body states: The degrees of freedom appearing in Table XV-2 are 
those occurring at the primary vertex, at which the Higgs decay process begins. 
However, these are often not the final states which are actually detected. For exam- 
ple, the quark—antiquark states will hadronize into jets whereas the vector gauge 
bosons will quickly decay and be observed as four-fermion final states, e.g., as in 
final states containing leptons and antileptons. We do not display analytic formu- 
lae here for such modes, but numerical examples are displayed in Table XV-3. 
The leptons and neutrinos there are summed respectively over £ = e, u, Tt and 
WS Yes Vp Ves 


Decay into massless final-state particles: The general leading-order H > yy 
decay rate is given in Eq. (2.11). Approximating this with the W-boson and top- 
quark contributions gives 


2 


E Ai lw) + Neq; Ain) , (3.6) 


(Lo) n~ © 
H>yy ~ 95673 


where the quantities A;(xw) and Aj/2(x;) are the loop functions defined in 
Eq. (2.11), with arguments xw = M?,/(4M;j,) and x, = M?,/(4m?). Although the 


Table XV-3. Four-body Higgs branching fractions.“ 


(4444) (qqeve)? (qqvv) (qql*£-) CL) 
11.8 3.38 0.81 0.27 0.03 


“All branching fractions are in % and the value My = 126. GeV is assumed. 
Here, l= e fi. 
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top-quark contribution dominates that of the other fermions (due to its much larger 
Yukawa coupling to the Higgs), that of the W-boson is even larger, |N-q?A1/2(x1)/ 
Ai (xw)| = 0.22. 

For the transition H — gg, decay products would appear as jets consisting 
of light hadrons. The H — gg decay rate has already been given in Eq. (2.17). 
Approximating this with the dominant top-quark contribution in the heavy m, limit 
yields the tree-level expression, 

ao _ & My (3.7) 

A 88 72132" l 
Virtual gluon exchanges will modify the above. Unlike the case for H —> yy the 
next-to-leading-order H —> gg amplitude will experience gluon self-interactions 
such as triple-gluon vertices and turns out to have a large numerical effect 
[SpDGZ 95], 


LO 95 7 (LO) 
T H> gg ~ Sa f + (7 = g as (M7) tp ees > 1.64 Ds pes (3.8) 


with n ¢ = 5 and œs (Mp) given previously in Eq. (II—2.79). 


Production 


Next, we consider the most important of the mechanisms at LHC energies for 
producing the Higgs boson in the inclusive process p + p —> H + X, where X 
represents a sum over all the other final-state particles. The scattering which yields 
the Higgs production will involve the basic degrees of freedom (partons) occur- 
ring within a proton, the quarks and gluons. Because the partons are not physical 
entities, the cross section must be expressed as 


1 
o= D| dx dx2 fixi, Q)f;(%2, Q)Gi;, (3.9) 
i,j 


where the indices i, j refer to the two initial-state protons and the quantities f; 
and f; are parton distribution functions (‘PDFs’). A hadron’s PDF f(x, Q) gives 
the probability density for finding a parton carrying a fraction x of the hadronic 
longitudinal momentum at momentum reference scale Q. Given the difficulty pre- 
sented by nonperturbative QCD, a PDF is commonly inferred from experimental 
data, e.g., as with 


filx, Q) = Nx" (1 =a)" gi). (3.10) 


where œ;, p; are fit parameters. The function g; (x) is defined to approach constants 
atx = 0, 1 e.g., g)(x) = 146; fat Dix+Ejx? and itself contains the fit parameters 
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(a) (b) (c) (d) 
Fig. XV-3 Higgs production via: (a) gg fusion, (b) VBF, (c) HV, (d) tt H. 


€i, Di, Ei. The parton cross section Õij is calculated at leading order from various 
Standard Method processes and corrected by both QCD and EW perturbations. 

Within this phenomenological framework, one has at the Higgs mass scale and 
LHC energies the following Standard Model mechanisms, depicted to leading order 
in Figure XV-3 and listed here according to cross-section magnitude: 


(1) Gluon-gluon fusion (gg fusion): gg —> tt > H 

(2) Vector—boson fusion (VBF): qq —> qq + V*V* > qq+H 

(3) Vector—boson-associated production (HV): qq > V* > H +V 
(4) tt associated-production (tt H): gg —> tt + H, 


Numerical values [He et al. 13] for each of these contributions at the energies 
Js = 8, 14 TeV appear in Table XV—4. Table X V-4 contains not only cross-section 
values but also uncertainties for each, given numerically in per cent. These arise 
mainly from aspects of QCD, such as uncertainties in QCD parameters (e.g. as, 
Me, etc.), parton PDFs and a significant uncertainty from the uncalculated higher- 
order QCD corrections. 

The gluon—gluon fusion reaction proceeding via top-quark loops is the domi- 
nant component of the p + p —> H + X cross section.* It also has the interesting 
property of being sensitive to certain types of virtual heavy particles. We saw in 
the derivation of Eq. (2.20) that the top-quark contribution to the triangle graph for 
H — gg does not decouple, despite having 4m? >> M?,, because the coupling 


Table XV-4. Standard Model Higgs production cross sections.“ 


a/s (TeV) gg Fusion VBF HW HZ ttH 
8 18.97 C122) 1.57 (C934) 0.69 (+1.0%) 0.41 (3.2%) 0.19357) 
14 49.85132) SABC), ISOC) Deel) 0.618) 


“All cross sections are in pb units; the value My = 126 GeV is used for ./s = 8 (TeV) 
and My = 125 GeV for ./s = 14 (TeV). 


4 The next most important contribution, that of the b-quark loop, is estimated at leading order to be at most a 
10% effect. 
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between tt H is proportional to m,. Thus, what if there were a very heavy fourth 
generation of Standard Model fermions (a situation often denoted as SM4) with 
all else the same (i.e. same Standard Model couplings, only one physical Higgs 
boson) as with the known fermions? The new generation would contain two new, 
very heavy quarks, say u4, d4, which likewise would not decouple in the H —> gg 
vertex. The H — gg amplitude would then be about a factor three larger than 
in the Standard Model case, and the gluon—fusion production cross section about 
nine times as large. Moreover, using LHC and Tevatron data as input, it has been 
concluded from an analysis of Higgs decay modes that SM; is excluded at more 
than 50 [EDHLLMNW 12]. 

Earlier, in the discussion following Eq. (3.1), we pointed out that detection of 
final states like bb, gg, c¢ at the LHC, where the gg —> H is the dominant pro- 
duction mechanism, is greatly hindered by hadronic backgrounds. However, a bb 
final state can be relatively more accessible if the Higgs particle is predominantly 
produced in association with a vector boson (V = W, Z) or a tt pair, a strategy 
which has been pursued by the detectors CDF and DO (Tevatron) and ATLAS and 
CMS (LHC). This can lead to detection of H —> bb via more easily identifiable 
configurations like 


HW => bbew, HZ —> bbe, HW, HZ > Erbb 


where £ = e, u and Ær represents missing transverse energy. Some promising 
results have been obtained thus far, e.g., a reported excess of events at 3.lo with 
My = 125. GeV [Aa et al. (CDF and DO Collabs.) 13] and a > 30 significance 
in the combined tt + bb channels reported by the CMS collaboration at the 2013 
Lepton—Photon Conference. 


Comparison of Standard Model expectations with LHC data 


Statistical data analyses have been performed to test the extent to which collected 
data agree with the Standard Model Higgs boson scenario. Such testing can be done 
directly by experimental collaboration or as a theoretically motivated exercise: 


(1) Experimental: One can define a global signal strength factor m; for a given 
final state ‘i’ by folding together the production cross section and branching 
fraction for the observed signal relative to the Standard Model prediction, 


pa Oj>H Bri. | 

É ob: 
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There is a label ‘j’ because a given final state ‘i’ might be summed over a 


Hi = (3.11) 


subset of Higgs production processes ‘i’. The value u = O corresponds to 
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the background-only hypothesis whereas u = 1 corresponds to the Standard 
Model Higgs boson signal in addition to the background. Announced results 
from the LHC detectors have been found, thus far, to be statistically consistent 
with the Standard Model hypothesis. 

Theoretical: There are a number of ways to parameterize couplings to include 
non-Standard Model behavior. Suppose Standard Model Higgs couplings to 
fermion f and to vector boson V are generalized to have the forms [ElY 12], 


2(1+e) 
_ fs mpy! _ My My 


where e and M are purely phenomenological parameters. In the Standard 
Model, they become e€ = 0 and M = v ~ 246 GeV. A global fit to LHC 
data yields results consistent with these values, € = 0.05 + 0.08 and M = 
241 + 18 GeV. 

Another procedure is to consider an effective lagrangian for the electroweak 
symmetry-breaking sector, which modifies couplings to vector mesons and 
fermions in terms of universal parameters ‘a’ and ‘c’. 


H = H 
Lam >> mM VV” [1 +202 -Y mf fi iter] +e 
V=W,Z K i “ 
(3.13) 


where nw = 1, nz = 1/2, and the ellipses represent a sum over all remaining 
Standard Model contributions as well as possible higher-order terms in the field 
variable H. In the Standard Model, we have a = c = 1. Fits to the current 
dataset again yield results consistent with Standard Model expectations [E1Y 
12, ESGMT 12]. 

The above parameterizations are just two examples of Higgs-related 
phenomenology. These tests, and others, will continue into the future as the 
Higgs database expands. 


XV-4 Higgs contributions to electroweak corrections 


Prior to the discovery of a new boson at the LHC, direct Higgs searches yielded 
only upper bounds, e.g., as with My < 114.4 GeV obtained at LEP2. How- 
ever, the calculation of quantum corrections to Standard Model predictions came 
to play a central role in particle phenomenology and Higgs physics in particular. 
The procedure is straightforward; a collection of observables (My, ...) is mea- 
sured and then compared to predictions expressed in terms of a set of input para- 
meters (G,,@,...) including the Higgs mass My (cf. Sect. XVI-6). Although 
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the dependence on Higgs mass in such analyses is somewhat weak, being loga- 
rithmic ~ In M?, at leading order, it has continued to show for quite some time 
that the Higgs boson is ‘light’. A recent Ax? fit gives [Ba et al. (Gfitter group) 
12] My = 94t GeV. That this value is consistent with the LHC determinations 
of My is generally regarded as a noteworthy success of the Standard Model. To 
observe the role of the Higgs boson in this procedure, let us next consider a few 
specific examples of such corrections. 


The corrections Ap and Ar 


Higgs contributions to Ap: We begin with the so-called effective weak mixing 
angle 


= 1— —2 4 Ap, (4.1) 


which is discussed at length in Sect. XVI-1. The corrections to 52, are contained 
within the quantity Ap. For arbitrary My, the one-loop Higgs contribution to Ap 
is 


1-loop 3 Mọ 2 2 
App = -i (5) f (Mi,/M3), (4.2a) 
where 
lnc? — lnx Inx 
fay =s] ok zil (4.2b) 
The leading dependence on My for My >> My is logarithmic, 
2 2 2 
Apy Te -Ż (5) Z In ve (4.3) 


as are all the other leading one-loop Higgs contributions.” A term like In Mj,/Mj, 
does not respond sensitively to changes in M7,, so the shift Apy by itself does 
not lead to a precise estimate for My. 

There are also multi-loop Higgs contributions. In contrast to the In M;,/M;, 
logarithmic dependence of the one-loop amplitude, these also contain power-law 
dependence on My, 


2 
ea ) saM (4.4a) 


2-loop 
Apr 0.1499 (Ss 2M,” 


5 It is, however, not the case that one-loop corrections for all the remaining Standard Model particles are 
logarithmic, e.g., Ap has a O(Gym?) dependence on the t-quark mass (viz. Sect. XVI-6). 
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Table XV-5. Higgs contribution to Ap. 


Order: One-loop Two-loop Three-loop 
—1.8 x 1073 8.1 x 1077 —6.2 x 1078 


and the three-loop amplitude gives 


. M2, \° s2 M4 
3-loop W wi H 
A ~ —1.728 ; 4.4b 
PH (5) c2 M4, ( ) 
Observe that common to all terms is the coefficient, 
Mj, 
Da 0.0027. (4.5) 


An extra power of this small quantity will accompany each additional loop and thus 
suppress the multi-loop contributions, at least for moderate values of My. Note 
also that the two-loop and three-loop amplitudes have opposite sign. The values of 
the one-loop, two-loop, and three-loop amplitudes are summarized in Table XV—5 
using My = 126. GeV. The one-loop amplitude is dominant and gives an accurate 
estimate of the Higgs contribution to Ap. 

Higgs contributions to Ar: A second class of Standard Model corrections affects 
the relation between the Fermi constant and My, given to leading order by 
Eq. (II-3.43). Upon using Eq. (II-3.42) and Eq. (II-3.33), we can express this as 


M? TA 
M? ( — ze) = ; (4.6) 
. MZ) 2G, 


The one-loop Higgs correction to this relation, 


M? 3 
M2, (1 w) = (1 + Arh ae) (4.7) 


MES > OG), 
is given by 
: 11 M? M, 5 
1-loop W H 
Ary = . (i — J (4.8) 
48m? v? My, 6 
Custodial symmetry 


As part of our discussion of chiral symmetry in Chap. IV, we obtained a repre- 
sentation of the linear sigma model by expressing an SU (2); x SU(2)pr invariant 
lagrangian (cf. Eq. ([V—1.4)) in terms of two chiral fermions yz, Wr and a2 x 2 
matrix & = ø + it - x of four scalar fields. The SU (2); x SU(2)R transformation 
properties were Y > LY, Wr > Rvp and È > LDR’ with L, R in SU(2). 
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Somewhat analogously, we can express the Higgs doublet as a matrix H via the 
construction® 


1 ~ 1 g% a) 
V2 ( ) fe Ce g? =A 


where ® is the Higgs field of Eq. (II-3.16) and ® is its conjugate.’ This will be 
convenient for considering transformations of both SU (2); and SU (2)r. 

Even though this chapter is, for the most part, a discussion/celebration of the 
M ~ 125 GeV particle, which could well be the Standard Model Higgs boson, 
we shall, for the remainder of this section, instead emphasize the symmetry aspect 
of the Higgs sector. In the notation introduced above, a Higgs lagrangian invariant 
under SU (2); x U(1)y gauge symmetry is 


Luiiges = Tr [(D“H)* D,,H] — V HH), (4.10a) 


where the covariant derivative is 


D,H = (8, + iS But + igo .W,)H, (4.10b) 
and the potential has the form 
V (HH) = -u° Tr [HH] + à (Tr [H*H])’. (4.10c) 


The matrix tz in Eq. (4.10b) accounts for the opposite relative weak hypercharge 
of ® and its conjugate ®. That the lagrangian Lhiggs of Eq. (4.10a) is indeed gauge- 
invariant can be verified by noting 


SUQ), : H > LH and D,H > L(D,H) U(Q)y: H> He., (4.11) 


Actually, the potential energy V(H'H) of Eq. (4.10c) (but not the kinetic part in 
Eq. (4.10a)) is invariant under the larger set of SU(2); x SU (2) p transformations. 

Thus far, we have simply used a new notation to reproduce what we already 
know. In order to learn something new, however, consider the limit g; — 0. There 
is now present the symmetry, SU (2), under which 


SU(2)r: H —> HR’ and D,H > (D,B) Rİ. (4.12) 


Thus, for the combined SU (2); x SU (2)p transformations, we have H > LHR’, 
like the sigma model matrix & mentioned at the beginning of this section. Although 
true, the above analysis is incomplete; we must address the Higgs spontaneous 


6 In the following, we adopt the general approach of [SiS VZ 80] and [Wi 04]. 
7 Tn the language of group theory, the conjugate spinor ® = it7 ®* is equivalent to ®. 
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symmetry breaking of Eq. (II-3.25), for which the ground-state configuration of H 


becomes 
Lee o0 
H) = = 4.1 
(H) = 5 o ,) (4.13) 
with v = (u? /à)!2 as in Eq. (II-3.24). Although this ground state does not respect 
the full SU (2); x SU(2)r symmetry, 


L(H) # (B), (H) R* # (H), (4.14a) 


it does remain invariant under SU(2);4, transformations, i.e., those having 
REL; 


L(H) LÝ = (B). (4.14b) 


This SU (2)z+pr invariance is often referred to as custodial symmetry [SiSVZ 80]. 

In Chap. II, the basis of our discussion of the electroweak sector was the Higgs 
effect, i.e., the spontaneous breaking of the gauge symmetry SU(2); x U(1)y. 
Here, let us instead use elementary group theory to see what the g; = O world, 
with its exact SU (2). global symmetry, would be like.* Eq. (II-3.31) shows that 
setting g; = 0 would cause the weak mixing angle to vanish, 0w — 0, and so from 
Eq. (II-3.30) for Z? > W3. 

It follows from Eq. (I-5.17) that the three W-boson fields would transform 
as an isotriplet under the (global!) SU (2); transformations, and as an isosinglet 
under SU(2)r (since g} = 0). They would thus transform as an isotriplet under 
SU (2)z+r and, since the SU(2);42 symmetry is exact, the W triplet would be 
degenerate. The above remarks imply the equality 


p = (Mw/(Mzcos w)? =1 (in the gı — O limit). (4.15a) 


When viewed as a statement of invariance, this equality is a consequence of the 
SU (2)L+r symmetry, which is called ‘custodial’ for this reason. As we then return 
to the real world of g, Æ 0 and allow for higher-order Standard Model corrections, 
we would expect corrections to p = 1 to be modest [SiSVZ 80], 


p=1+0(@) + Ola (m? — m?) /M3,). (4.15b) 


XV-5 The quantum Higgs potential and vacuum stability 


Our treatment of the Higgs potential has thus far been at the classical level. We have 
simply taken the quadratic and quartic terms that appear in the bare lagrangian, 


8 For example, the electric charge would vanish (cf. Eq. (II-3.42)), so modest mass shifts would occur, e.g., the 
leading-order contribution to pion mass splitting would vanish, etc. 
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minimized the energy, and found the vacuum expectation value and the Higgs 
mass. However, quantum effects modify this form significantly, most importantly 
through a top-quark loop. Even more remarkably, the presently indicated value of 
the Higgs and top masses indicate that we are very close to the border where the 
Higgs potential is actually unstable. In this section, we explore the nature of the 
quantum effects. Our focus is on the role of the top quark, which is the major 
contributor to the potential instability. 

The Higgs potential describes the vacuum energy as a function of a constant 
Higgs field. Since the top-quark mass and the Higgs Yukawa coupling to the top 
quark are related, it is convenient to define a background field h(x) = v + H(x). 
In the following we take H (and hence A) as constant and thus omit any spacetime 
dependence, 


r; = Tr; = = 
—£L,= — (v + A)tt = —htt =m, (Att, (5.1) 
t V2 V2 t 
where m,(h) = T,h/ /2 is the field-dependent mass. We then calculate the vacuum 
energy as a function of m,(h). This can be done relatively simply by studying the 
tt contribution to the vacuum matrix element of the energy-momentum tensor Tyv, 


OTa = Ne f SP, imh + i 
(O| Tul )top = c Gry 2 Yu Pv PP amre 


_ 12 f d’ p i 
= Onyi PP” p mh) + ie 
= 8V (h)8guv» (ae 


where the important minus sign comes from the Feynman rule for a closed fermion 
loop. This leads to a result 


4h 2 2(h 
| = 7 + nd -n 4 3] 


8V (h) = —— 
4d we 2 


(5.3) 


with ua being the scale that enters in dimensionally regularized integrals.” The 
divergence is proportional to m*(h) ~ h* and thus goes into the renormalization 
of the Ag* term in the Higgs potential. In the MS scheme, one then arrives at the 
potential, 


1 


va) =-; 


4 2 
3m} (h) [m m; (h) _ | (5.4) 


yh? + ai= 
4 16r? R 2 


The —m? (h) In m? (h) ~ —h*Inh? term from the loop diagram is the key new 
feature. 


9 Tn this context we add the subscript to (zg to avoid confusion with the — py term in Higgs potential. 
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We note that the logarithmic term produces an instability for large enough values 
of the field h. No matter what the coefficient A(z) of the h* term is, the —h* In h? 
term eventually will overpower it and lead to a potential that is unbounded below at 
large enough values of h. However, New Physics (NP) beyond the Standard Model 
could modify this result, for example by generating an effective operator 


-Lre = aa (v'o) = oh. (5.5) 
At the very least, such effects should be generated at the Planck scale A ~ Mp, 
so that we should not be concerned if the apparent instability occurs beyond the 
Planck scale. However, if the instability occurs at a lower scale, it implies either 
that the vacuum is at best meta-stable — a very dramatic conclusion — or that other 
New Physics must come in before the Planck scale — also important. 

To use the quantum effective potential, one minimizes the energy with the va- 
cuum expectation value constrained to equal 246 GeV and the top-quark mass 
equal to its physical value, and determines the Higgs mass parameter from the 
quadratic term in the expansion. However, unlike at tree level, the curvature of the 
potential near the minimum does not give the physical Higgs mass. In order to get 
the Higgs pole mass one must include finite momentum effects from the vacuum 
polarization diagrams. 

Given the physical values of these parameters, indications are that the potential 
is close to being unstable below the Planck scale. A more detailed treatment must 
include the effects of the Higgs itself and of the other particles. The state of the art 
includes the inclusion of more loops and the use of running couplings [De et al. 
12]. Moreover, if the seesaw mechanism is at play for neutrino masses, the neutrino 
Yukawa couplings provide an extra unknown destabilizing influence [CaDIQ 00]. 
It remains very interesting that the parameters of the Standard Model place us 
so close to this prediction of an unstable Higgs potential, implying yet another 
suggestion of New Physics below the Planck scale. 


XV-6 Two Higgs doublets 


Earlier in this chapter we briefly discussed the issue of a very heavy fourth quark 
generation, assumed to otherwise resemble the observed three generations. On the 
one hand, it would introduce new particles and thus lie beyond the Standard Model; 
on the other, it would respect the twin pillars of gauge symmetry and spontaneous 
symmetry breaking of a scalar doublet on which the Standard Model is based. 
Here, we proceed analogously by briefly considering the replacement of a single 
Higgs doublet ® by two Higgs doublets (®;, 2) having the same SU (2) @ U(1) 
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quantum numbers.!°? A two-Higgs-doublet theory would enlarge the spectrum of 
Higgs bosons and also considerably enrich the content of the Higgs potential. 


Spectrum: Since each Higgs doublet corresponds to four real fields as in Eq. (1.2), 
then two Higgs doublets will amount to eight real fields. Of these, three will become 
the longitudinal degrees of freedom of the Z° and W* gauge bosons. There will 
also be five spinless Higgs particles: a charged pair (H~), two CP = +1 neutrals 
(H, h), and one CP = —1 neutral (A). If we associate H with the Higgs boson 
of the one-doublet theory, then the two-doublet model predicts the four new parti- 
cles h, A, H~. At present, there is no experimental evidence for any of these four. 
Current lower-mass bounds are in the range of roughly 100 GeV for each [RPP 12]. 

Consider, for example, charged Higgs particles [Le 73] whose rich phenomeno- 
logy was realized early on [DoL 79, GoY 79]. The H= particles can be sought 
directly or indirectly: 


(1) Direct: Charged Higgs-pair production, e~e~ — H+ H™~ would arise via H* 
coupling to photons and Z° bosons. A charged Higgs could also couple semi- 
weakly to the known fermions with strength proportional to the fermion mass. 
Thus, at the LHC, a study [Aa et al. (ATLAS collab.) 13a] of gg —> tt 
followed by a decay chain such as 


t — Htb>csb and ft—> H7 b—>csb 


has yielded sharp upper limits on Gr;,47+, for the mass range 90 < My 
(GeV) < 150. 


(2) Indirect: A charged Higgs could contribute as a virtual particle, as with the 
leptonic decay of a B meson, 


2 
2 
(SM) 29 MR 
BrB+ ttv = Br 5+ set 1 — tan B M2 E 
H= 


where tan B = (o8) /(@°). 
Higgs potential: The Standard Model Higgs potential energy of Eq. (II-3.19) 
is based on one quadratic mass term and one quartic Higgs self-coupling. The 
most general renormalizable SU (2) ® U(1) two-Higgs-doublet version has three 


quadratic mass terms and seven quartic Higgs self-couplings, 


10 The possibility of two Higgs-doublets is usually associated with supersymmetry, but this is not necessary. 


456 The Higgs boson 
Vo-Higes = mj, PÍO, + m3, 0, — [mh Pi D + hic.] 


À . À 4 
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i às (ate \2 
+ A3(®} 1) (Did) +a (Pid) (Pid) + E (ojo) + he. 
+ [Ac] + h.c.]b) d + [A7H} d + hic.]O}o>. (6.1) 


Since this most general structure has the potential to produce overly large flavor- 
changing neutral currents (FCNCs) or gross violations of custodial symmetry, it 
cannot be realized in Nature without restrictions on the ten free parameters. A great 
deal of research on V2-Higgs has been reported in the literature; two recent works cit- 
ing many earlier contributions are [MaM 10] and [HaO 11]. Two additional items of 
interest deserve mention. One is that the above potential energy allows for CP vio- 
lation. A careful discussion appears in Chapter 22 of [BrLS 99]. Another involves 
the vexing strong CP problem of QCD. It has been shown that introduction of a 
“‘Peccei—Quinn’ global U (1)po symmetry [PeQ 77], which becomes spontaneously 
broken, can lead to a solution of the problem. The two-Higgs framework provides 
a natural platform for the U (1) po symmetry. 


Problems 


(1) The rho parameter 
(a) Show that for an arbitrary number of Higgs multiplets ((g;)o 4 0, 
(i = 1,...)), the rho parameter becomes 


DUW; + Uw) — Gna) KAN 
2 D; Unadi (Piro 
(b) Given two Higgs fields, with quantum numbers Iw = —Jy3 = 1/2 and 
Iw = 1, Iw3 = 0 respectively, and with nonvanishing vacuum expectation 
values (~1/2) and (91), obtain a bound for |(91)/(91/2)| assuming an exper- 
imental value pọ = 1.0004 + 0.0003. 
(2) Higgs-gluon coupling 
In the text we used the background field method to show that, at lowest order 
in the momenta, the effective Higgs coupling to gluons is 


ds h? 
Le ——|1 ee aoe 
f 240 (5 ) 
with h = v + H. As mentioned briefly in the text, this coupling implies a 


cancelation in the Standard Model prediction for the reaction in which two 
gluons produce two Higgs bosons, which makes the residual effect small. 


(3 


wa 
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In addition to the direct coupling from the above effective lagrangian, there 
is a pole diagram of GG — H — HH, which utilizes the triple Higgs cou- 
pling. Show that these two contributions cancel exactly at threshold. 

Higgs sector and the cosmological constant 

The Higgs sector makes several contributions to the cosmological constant, A, 
which is defined as the energy density of the vacuum. The observed value of 
the cosmological constant is A = Uyas = 2.8 x 1074 GeV‘. In Eq. (2.9) 
we displayed one contribution that is 51 orders of magnitude larger than the 
observed value. Other calculable contributions also come from the Higgs sec- 
tor. For example, show that if one changes the up-quark Yukawa coupling by 
a few parts in 107%, one changes the cosmological constant by 100%. The 
leading change is linear in the Yukawa coupling, and to uncover this you may 
use the effective lagrangians of Chap. VII. Specifically, compare the Yukawa 
coupling’s effect on the vacuum expectation value of the lagrangian to the con- 
tribution of the Yukawa coupling to the mass of the pion, expressing the result 
in terms of Fz, My and ratios of the quark masses. 


XVI 


The electroweak sector 


Early studies of the weak interactions were confined to processes, like nuclear beta 
decay and muon decay, which concern just the charged weak current. Starting from 
the mid-1970s, the field of weak interaction phenomenology was broadened by 
experiments involving neutral weak currents. The advent of collider experiments 
made possible direct studies of the W* and Z° gauge bosons themselves. This 
chapter will first address the topic of low-energy neutral-current phenomenology 


and then consider physical processes at the higher mass scales Mw and Mz. To 
conclude, we turn to the more theoretical topic of electroweak radiative corrections 
and renormalization. Throughout, we shall keep our treatment at a relatively simple 
introductory level. 


XVI-1 Neutral weak phenomena at low energy 


The words ‘low energy’ in the title of this section denote processes with Q? « M A 
We shall focus on three of these: 


(1) deep-inelastic neutrino scattering (DIvS), 
(2) atomic parity violation (APV), 
(3) parity-violating (PV) Møller scattering. 


In each case, the main finding is a determination of the weak mixing angle at the 

kinematical scale u = Q appropriate to that experiment. In this context, it is con- 
venient to use a scale-dependent version of the weak mixing angle, such as the 
MS quantity $2 (u).! Then, we display in Fig. XVI-1 the dependence of $2 on Q? 
found from both low-energy and high-energy studies. Fig. XVI-1, although not 
yet reaching the iconic status of Fig. H—6 (which displays the asymptotic freedom 
! We employ the common abbreviations sy = sin Oy, Cy = cos Ow and also employ sy for MS 


renormalization. For convenience, we shall refer (admittedly loosely) to the quantity s2, as the ‘weak 
mixing angle’. 
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Fig. XVI-1 Scale dependence of $2, from [RPP 12] (used with permission). 


property of QCD), has become an apt representation of this field. It has, indeed, 
been a major achievement of low-energy neutral-current studies to verify (within 
experimental uncertainties) the variation of So (u) with scale jz expected from the 
Standard Model. 

One can use the renormalization group to ‘run’ each of the low-energy deter- 
minations up to a standard reference scale, say u = Mz, to provide the values 
for 52(M z) shown in Table XVI-1 [KuMMS 13]. For comparison’s sake is also 
included the quantity §2(Mz) obtained by using an average of data from experi- 
ments carried out directly at the Z° scale, e.g., Z° decays and cross-section asym- 
metries, cf. Sect. XVI-2. At present, the high-energy determination is far more 
accurate than the low-energy determination due to its dominance in statistics. 


Neutral-current effective lagrangians 


To provide a theoretical language for such low-energy experiments, let us identify 
effective lagrangians for some neutral-current processes. Recall from Eq. (II-3.40) 
that the neutral weak interaction between the gauge boson Z° and a fermion f is 
given at tree level by 


Table XVI-1. Weak mixing angle from neutral-current 


experiments 
Experiment (0% (GeV?) $2 (M2) 
DIvS 20 0.2356(16) 
APV (in Cs) 5.8 x 1076 0.2383 (20) 
PV Møller 2.6 x 107? 0.2329(13) 


Average at Z mass scale M3 ~ 8.3 x 10% 0.23125(016) 
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(f) 82,0 (f) (f) 
Latl-wk = — Jer Zad (ey oY” + 8, 0V Hys) f 
w,0 
(f) (f) (f) (f) (f) 
80 = Tw T 25° 50. , 820 = Ty3 > (1.1) 


where we denote tree-level parameters with a ‘0° subscript. Examples of individual 
Be and ov } appear in Eq. (II-3.41). To describe neutral-current interactions at 
low energies, one forms an effective four-fermion lagrangian, akin to the Fermi 
model of charged-current interactions. At tree level, the Z°-mediated interaction in 


the low-energy limit is 


1 go 1 
L= -> DF (star" + gov" ys) f = 


( 
ai (90% + a Yrs) f 
Z,0 
ES F (ody + gcoy"ys \f Fie, o Yu + go vays) T (1.2) 
FT 


res 
ne 
where po is the tree-level rho parameter, 


1 Mio 


cena 1.3 
o M2. oa 


po = 
Comparison of the second of the relations in Eq. (1.2) with Eq. (V—2.1) shows that 
po governs the relative strengths of the neutral and charged weak-current effective 
lagrangians. In the Standard Model, it has the tree-level value unity, p N M — 1. The 
reader might wonder — why include a quantity, pọ, whose Standard “Model value 
is unity? There are actually two reasons: (i) although pe™ )=1, po is not unity in 
general, e.g., alternative choices for Higgs structure can lead to different values for 
po (cf. Prob. XV—1), and (ii) even in the Standard Model, electroweak corrections 
will change its value away from unity (cf. Sect. XVI-6). 
The set of low-energy neutral-current processes includes neutrino-—electron, 
neutrino—quark, and parity-violating electron—quark interactions. There is an effec- 
tive lagrangian for each of these, two examples being 


Gu 7 a) — 
Lig = -5 Tey” (l + ys)ve le Tua Yu l + V5)qa + Gey. — Ys)qa | 


G 2 _ = _ 
LO) = ——E [Cley"yse Ga ¥nda + Cleve Ga ¥n Ysa , (1.4) 
af 2 
where the index «œ = u, d, ... denotes quark flavor. Of course, contributions other 


than neutral weak effects also enter, e.g., parity-conserving eq scattering experi- 
ences the electromagnetic interaction. 

In Eq. (1.4), we have implicitly included the effect of radiative corrections and 
thus omit the subscript ‘0’. Table XVI-2 gives a compilation of the radiatively 
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Table XVI-2. Radiatively corrected coefficients. 


Coefficient General form“ 

en Pvn (G — an) 
d 

g PvN -4 + T a 


d 
0 PvN Dar i 
ci Peq (-3 + Kegs) 
ci Peq (3 = 3KeqS ) 


“Small additive terms are omitted. 


corrected coefficients (with renormalization scheme left unspecified). The quan- 
tities p; and x; in Table XVI-2 reduce at tree level to unity, p;o = Kio = 1. The 
pi are overall multiplicative factors and the k; multiply the weak mixing angle, 
which itself has become renormalized, ae — s2. The presence of such quantities 
in the effective lagrangians can be traced back to the underlying neutral-current 
couplings, 


) ) ( 
ilo > 8 = SBF (E — 27500). 2 > 8 = oF TAP. 1S) 


where, again, we leave the renormalization scheme unspecified. However, see 
[MaS 80] for the introduction of MS renormalization to electroweak corrections. 
The quantities p; and x; will be discussed in more detail later in Sect. XVI-6. 


Deep-inelastic neutrino scattering from isoscalar targets 


In deep-inelastic scattering, one measures the ratios of neutral to charged-current 
neutrino/antineutrino cross sections, 


R, P = OW F /ON ’ R5 = OM deere (1.6) 


Under the conditions of ‘deep-inelastic’ kinematics ([BaP 87]), theoretical calcula- 
tions of R, and R; are carried out in terms of quark, rather than hadronic, degrees 
of freedom. It is plausible that by working with ratios like those in Eq. (1.6), theo- 
retical uncertainites associated with hadron structure tend to cancel. At tree level, 
R, and R; are straightforwardly computed if scattering from an isoscalar target is 
assumed and antiquark contributions are ignored. It is useful to express the er 
coefficients of Eq. (1.4) as 
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2 2 1 5 
d 
g = (e) T Gi i SNT 9 (1+ 70) So 


7 2 2 5 
wh = (en) (L) = sin (1.7) 


These quantities can be determined from the combination of neutrino and anti- 
neutrino cross sections, 


Ry rR; 
Ry = =" = g2 + 2%, 1.8 
+ itr EL EER (1.8) 
where r=1/r = On fa are measurable quantities with tree-level values 


ro="o ' 3. The NuTeV experiment [Ze et al. 01] at Fermilab, carried out at an 
average momentum-squared transfer (Q?) = (—q?) ~ 20 GeV’, has yielded the 
most precise determination to date, 


g2 = 0.3005 0.0014, g% = 0.0310 0.0011. (1.9) 


This translates into a determination of the weak mixing angle, which lies nearly 
30 above the stated Standard Model prediction, a finding which has spurred much 
discussion since then. 


Atomic parity violation in cesium 


The Z°-mediated electron—nucleus interaction, expressed here in the electron spin 
space, contains a component which is parity-violating, 


G 
Henc(r) = F OwYsPouci(r), (1.10) 
where ys signals the presence of parity violation and pnuci(r) reminds us that the 
electron feels the effect only where the nuclear density is nonvanishing.” The quan- 
tity Qw is the ‘weak nuclear charge’ to which the electron couples, and is given to 
lowest order by 


Qw o(N, Z) = —2 (NuC tig + NaC{o) = Z (1 — 4550) +N, (1.11) 


where Z and N are, respectively, the nuclear proton and neutron number. The fact 
that So ~ 0.25 suppresses the proton contribution, leaving the coupling of the 
atomic electron to neutrons as dominant. 

Consider the effect in atomic cesium, !**Ce. Because of the neutral weak-current 
interaction, the single valence electron in cesium contains small admixtures of P 


wave in its 6S (ground) and 7S (excited) states. We write these mixed states as \6S) 


2 The abbreviation ‘PNC’ stands for parity nonconservation. 
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and |7S). As a consequence, there occurs a measurable parity-violating 75 —> 6S 
electric-dipole (E1) transition matrix element [NoMW 88], 

Im Epyc = (7S)|D|6S) = Ov konc, (1.12a) 
where D is the electric-dipole operator and 


N y [PEE tes 4 ae | 
Ow Eos — Enp Eqs — Enp 


n 


kpnc = 


(1.12b) 


The experiments involve finding the ratio of the PNC amplitude Epyc to the vec- 
tor transition probability 6. The most accurate results to date on the 6S —> 78S tran- 
sition are Epyc/B = 1.5935(56) mV cm~! [BeCMRTWW 97] and 8 = 26.957 (51) 
ap [BeW 99]. However, interpretation of the PNC measurements requires evalu- 
ating Eq. (1.12b) and this contains intractable aspects of the atomic many-body 
problem. There has, however, been recent progress [PoBD 09] and the latest cal- 
culation gives [DzBFR 12] Epnc = 0.08977(40)i(—Qw/N), implying the weak- 
charge value 


Ow 8 Ce) = —72.58(29) expt (32) thy, (1.13) 


where the uncertainties refer respectively to statistical and theoretical contribu- 
tions. This result lies about 1.50 beneath the Standard Model prediction QSM 
(8Ce) = —73.23(2). 


Polarized Moller scattering 


Another experiment which has probed the weak mixing angle at a low-energy 
scale is polarized Møller scattering [An et al. (SLAC E158 collab.) 05], where 
we remind the reader that Møller scattering is the elastic scattering of electrons on 
electrons. In SLAC E158, a50 GeV beam of longitudinally polarized electrons was 
scattered from an unpolarized fixed target. The parity-violating observable is the 
asymmetry 

OR — OF 


Apy = , 1.14 
Or+ OL ( ) 


where op) is the cross section for incident right (left) polarized electrons. Rel- 
evant kinematic variables are the center-of-mass-squared energy s= (p + p’)’, 
the momentum transfer Q? = —q? = —(p — p’)?, and the ratio of the two, y = 
Q?/s =(1 — cos 6)/2, where @ is the scattering angle in the center of mass. The 
experiment was carried out with average values (Q?) = 0.026 GeV? and (y) ~ 0.6; 
the tiny asymmetry Apv = —131(14)stat(10)sys x 107° was found. 


464 The electroweak sector 


There are three parts to the theory analysis. First is the tree-level amplitude, 
where the Møller scattering amplitude arises from t-channel and u-channel y and 
Z° exchange diagrams. Parity violation is due to the interference of the electromag- 
netic and weak neutral-current amplitudes. An approximate tree-level expression 
for Apy which is valid for the conditions of the E158 experiment is 


_ GQ? l-y 
Vna 1+yt+(1- y) 
The dependence on the weak mixing angle suppresses ASS) due to the proximity 
of sin? Ow o to 1/4. 

Next are the one-loop corrections, due mainly to the y—Z° propagator-mixing 
terms induced by fermion and W-boson loop amplitudes [CzM 96]. These are 
absorbed by the MS running weak mixing angle, 


52w) = (1 + Ak (u) $ (M2), (1.16a) 


(1 — 4 sin? Owo) . (1.15) 


(tree) 
Aw 


as parameterized by Ax (u). In particular, one finds Ax (0) ~ 0.03, so that 
1—4sin62 ~ 0.075 = 1 -«(0)52(Mz) ~ 0.046. (1.16b) 


The rather small (~3%) effect of Ax (0) translates into a major (~40%) change 
in 1—4sin62! Finally comes the renormalization-group-improved analysis 
[ErR-M 05]. This serves to ameliorate the dependence on large logarithms dom? / 
Q?) and In(Mj,/Q7) for fermion and W-boson loops respectively), which appear 
in the one-loop amplitudes. This results in the improved determinations, 


Ak (0) = 0.03232 + 0.00029, 52 (0) = 0.23867 + 0.00016. (1.17) 


XVI-2 Measurements at the Z’ mass scale 


The collection of resonances observed in ee collisions as a function of the total 
center-of-mass energy is displayed in Fig. XVI-2. At the Z? mass scale, it is the 
weak interaction which dominates the physics in this reaction, with strong and elec- 
tromagnetic effects merely supplying modest corrections. An enormous database 
has been established at the Z° factories with the LEP1 experiments at CERN and 
the SLD collaboration at SLAC. There also exists data from pp —> ft f~ mea- 
sured at the Tevatron as well as that from pp —> ¢*t¢~ + X taken by the LHC 
detectors. These experiments have come to be analyzed in terms of a so-called 
“effective description’ wherein the renormalized vector and axial-vector couplings 
of Eq. (1.5) are written as 


a? = Vor (TP- OP), MW =verT?, AD 
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Fig. XVI-2 Resonances in ee collisions. 


where the superbars denote evaluation in the effective renormalization scheme 
associated with the scale u = Mz. In this approach, the effective weak mixing angle 
Sp for fermion f is defined so as to absorb the «x ¢ factor in Eq. (1.5) [Sc et al. 06], 


57 = Kps, (2.2a) 


and can be measured experimentally by 
a 1 ~(f) )= 
FETUA R (2.2b) 


independent of the quantity « p.’ 


We shall discuss two kinds of measurements in the following: Z° decay into 


fermion—antifermion pairs, which is sensitive to (aif 2 + (el 2, and cross-section 


3 Although both p f and « ¢ are both generally complex-valued, we shall tacitly use just the real part without 
further comment. 
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asymmetries in the reaction e~e+ —> ff at energy /s=Mz, wee determine 
the ratios a ) / alt ? and thus lead to precise determinations of s 57 but not of the 
individual couplings themselves. However, between the two ads of experiments 
a full determination of the couplings becomes possible. A comprehensive review 
of Z°-related studies carried out at CERN and SLAC appears in [Sc et al. 06]. 


Decays of Z’ into fermion—-antifermion pairs 


Experiments at the LEP and SLD colliders have provided accurate determinations 
of the Z° mass and decay modes [Sc et al. 06]. To lowest order, the decay of a Z° 
boson into a fermion—antifermion pair f f can be conveniently expressed as 


1/2 _ 
LYP = (V2GpM2) Z“ Fry (8P +8975) S, 23) 


where f =u,d, ve, e, .... Upon defining y = m? /M3 for fermion mass m, we 
obtain for the lowest-order transition rate to a pair f f, 


N: G, M3 g — 2g” 
ro 2 20) J, ody | py ee 
zos en Ja S A] y gi? 4 g 
Ne GuMz (P2 gu 
= oe Ee D2 4 gl)2) | 2.4a 
Ser ee Aa 


where N. =3 if f is a quark and unity otherwise. If the final-state fermions are 
quarks, QCD-radiative corrections modify Eq. (2.4a) by a multiplicative factor 


ôocD, 
s (My s (M3, 
soco = 1+ SAME) 4 144 (S0 5 e, fie œ 1.04, (2.4b) 


where a, (M 7) ~ 0.12 has been used in the above. 

There exist also electroweak radiative effects, which we can take into account 
by employing the effective weak coupling constants al! and g a? of Eq. (2.1). 
Upon including both strong and electroweak corrections, the tree- level relation of 
Eq. (2.4a) is replaced (shown here in the limit of massless final-state fermions) by 


Ne Gy M3 
P70, ¢7 = Nie a V2 Z (gl 


where 7 ¢ = dgcp if f is a quark and 7 ¢ = 1 otherwise. 

Some Z? decay-related quantities are listed in Table XVI-3. These results are 
taken from [Sc et al. 06], but many others are provided in this source. Although 
there will be some adjustments from more recent studies (e.g. see [RPP 12]), the 
overall picture provided by [Sc et al. 06] bears testimony to an impressive advance 


(2 4. g2), (2.5) 
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Table XVI-3. Z° decay [Sc et al. 06]. 


Measurable“ Experiment Standard Model prediction 
lez 83.9 + 0.1 84.00 + 0.01 
Finy 496.2 + 8.8 501.66 + 0.03 

r5 377.3 + 0.3 375.98 + 0.03 

Ttot 2495.2+2.3 2496.0 + 0.2 

a? 0.0012 + 0.0003 0.0011 > 0.0013 
gi? 0.2492 + 0.0012 0.2513 > 0.2518 


“Decay widths are expressed in units of MeV. 


in particle physics. One application, among many, of the Z° decays is to use lep- 
tonic modes to test the concept of lepton universality, and one finds 

Pup Vy 

= 1.0009 + 0.0027, 


ee ee 


= 1.0021 + 0.0030, (2.6) 


which is seen to be consistent with universality. 


Asymmetries at the Z? peak 


For the reaction e~e+ —> f f carried out at the Z? peak a natural variable is the 
asymmetry parameter for fermion f, 
=P) -(f) =F) ;-(f) 
Pee poy 9 oe a7 
f= o 2,- i (2T) 
5 / (f)2 ,-(f)2 
8v + 8a 1 + (a /2a ) 


which can be determined experimentally from angular distribution and/or polar- 
ization data, as discussed below. In the case that the final-state fermion f is a 
quark q, then it is hadrons which are detected and the final-state hadronic charge 
asymmetry which is measured. It is to be understood that the measured data have 
been corrected for contributions such as initial-state QED corrections, y exchange, 
y—Z° interference, etc., leaving asymmetries which are purely electroweak in ori- 
gin. Finally, let the incident electron beam carry a polarization P, but the positron 
beam be unpolarized. For LEP1, the incident electron beam is unpolarized (P, = 0), 
whereas for SLC one has partial polarization (P; ~ 0.75). 
In the following, the symbols op and og refer to 


1 0 

d 

OF sár] d cos 0 A OB =2n | d cos 0 aai (2.8) 
0 dX2 1 dQ 


and oz, Or denote the cross section for an incident left-handed and right-handed 
polarized electron. Then three types of asymmetry are: 
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Arg = ee [forward—backward], (2.9a) 
Or +0B 
_ OL—OR 
ALR= [left-right], (2.9b) 
OL t OR 


ALRFB = S — Or ~ OBdR [left-right forward-backward], (2.9c) 
(or — oB)L + (Or — OB)R 
and the relation of these to the asymmetry parameter of Eq. (2.7) is 


p = Aye A. + Pe 
1+A.P. 

Yet another approach is to exploit the fact that final-state tau leptons themselves 
carry a polarization P,, which affects the tau angular distribution as well as its FB 
asymmetry Pe. 
try parameters A, and .A, via the polarization measurements 


3 
Arr = AP. Ai = g^r (2.10) 


As such, the LEP1 experiments were able to extract the asymme- 


4 
A, = Py Ae = —3P rp: (2.11) 


The above set of asymmetries were the subject of much study for a number of 
years. One interesting example is the high-precision measurement of Azpg carried 
out by the SLD collaboration [Ab et al. (SLD collab.) 00]. The left-right asym- 
metry was measured from the e*e~ production cross section by counting (mainly) 
hadronic final states for each of the two longitudinal polarizations of the incident 
electron beam at energies near the Z° mass. Despite the emphasis on detecting 
final-state hadrons, this measurement actually probes the asymmetry parameter of 
the incident-state electrons, 


1 — 45? 


Arr = — t, 
"1 = 45? + 85 


(2.12) 
where we have assumed lepton universality in writing the weak mixing angle as 
57. The precision measurement of Azp then leads to the following determination 
of 57, 

A = 0.15138 + 0.00216, 5? = 0.23097 + 0.00027. (2.13) 


In summary, the collection of measurements taken at scale u = Mz has, on the 
whole, been in agreement with Standard Model expectations. 
Let us conclude by commenting on just a few topics: 


(1) Effective weak mixing angle sz: Adopting a Higgs mass value My = 125 GeV, 
the Standard Model prediction [ErS 13] s = 0.23158 is consistent with the 
experimental determination 57 = 0.23153 + 0.00016. 
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(2) Unresolved issue: A long-standing item is the roughly 30 difference between 
the two most precise individual measurements s;=0.23097 + 0.00027 
(via Arr from the SLD production cross sections discussed earlier) and 
s? = 0.23221 + 0.00029 (via Arg from the Z? — bb transition found at LEP). 
Despite much discussion, the issue remains unresolved. 

(3) Quantum corrections: The large collection of high-quality Z° data has pro- 
vided determinations which are sensitive to quantum corrections. For exam- 
ple, the result g® = —0.50125 + 0.00026 (found in part by assuming lepton 
universality) implies via Eq. (2.1) that pg = 1.005 + 0.001. This differs from 
the bare value p;""* = 1.000 by 5c and attests that quantum corrections have 
indeed been probed. Some even more impressive examples appear in Sect. I of 
[FeS 12]. 


Definitions of the weak mixing angle 


Thus far in this chapter, we have made reference to three different versions of (and 
notations for) the weak mixing angle, 


Effective : a On-shell : s2 MS : $2 (u). (2.14) 


Since there is, in principle, an unlimited number of renormalization prescriptions 
for a given quantity in quantum field theory, it is no surprise to come across the 
three above usages in the literature (several others, not covered here, also exist). 
Let us briefly consider their relation to each other, starting with the effective weak 
angle for a lepton £. 

Given the definition for 5? in Eq. (2.2b), it’s clear that this quantity is tied to the 
ratio g/g as measured at the scale u = Mz. The motivation for doing things 
this way is a matter of convenience for the massive experimental effort by the Z° 
factories — one reads off a basic quantity of the Standard Model directly in terms 
of Z°-related data. The current precise determination, given earlier and repeated 
here, of 57 =0.23153 + 0.00016 attests to the success achieved by the Z°-factory 
experimentalists in doing precision physics. 

We have already seen (cf. Sect. H—1) how modified minimal subtraction (MS) 
can be implemented in dimensional regularization for the electric charge e(q°), 
and one proceeds accordingly for the weak mixing angle 52 (q?) (or sw(q7)yqs) by 
adopting the scale-dependent definition [Ma 79, MaS 81], 


2 (2) = 5 ta (2.15) 


A fit to the current database yields the value appearing already in Table XVI-1, 
viz., $7 (Mz) = 0.23125 + 0.00016. 


470 The electroweak sector 


The effective and MS descriptions of the weak mixing angle can be related 
[GaS 94]. As pointed out in [GaS 94] there was, at the time of the LEP1 operation, 
‘considerable confusion among theorists and experimentalists alike as to the pre- 
cise conceptual and numerical relation between the two’. The analysis in [GaS 94] 
established that 


SZ = Reke(Mz) §2(Mz) ~ 1.0012 82 (Mz) = §2(Mz) + 0.0003. (2.16) 


This is in accord with the individual values for S; and 52 (M z) given above. 
Finally, the on-shell weak mixing angle is defined in terms of the physical gauge- 
boson masses, 


s? = 1 — Mî,/ M3. (2.17a) 


Thus, the on-shell weak mixing angle can be experimentally determined directly 
from Mw and Mz. Inserting the gauge-boson mass values from Table I-1 into 
Eq. (2.17a), one has 


Fa liii = 0.2229 + 0.0003. (2.17b) 
The current uncertainty in s? | My.M,? #0ut twice that in 57 and $2? (Mz), is due 
largely to the W~ mass uncertainty, 6 My = 15 MeV, compared to the much smaller 
ôMz =2.1 MeV. With the completion of the Tevatron data analysis, along with the 
resumption of LHC operations, the precision gap between the direct on-shell deter- 
mination and the alternative S: and sZ (Mz) schemes is expected to be narrowed. 
Even so, the fact that the on-shell scheme contains some relatively large O(G „m?) 
corrections (see Sect. XVI-6 for a discussion) not present in MS renormalization 
lessens its appeal for use in electroweak perturbation theory. 

Returning to the idea of scale-dependent (or running) quantities, consider the 
possibility of relating gauge coupling constants g, (k = 1, 2, 3) in the MS scheme 
at the Z° scale with those of a ‘grand unified’ theory defined at an energy Egur > 
Mz. The so-called GUT scale signals the existence of a gauge group undergoing 
spontaneous symmetry breaking to SU (3)e x SU(2), x U(1)y. The condition 


== 8 (E = Egur) (2.18) 


leads to a prediction [GeQW 74] for the weak mixing angle at the scale Egur. In the 
grand unified theory of SU(5) [GeG 74, La 81] and its supersymmetric extension 
(SU SY-SU (5)), the MS weak mixing angle obeys 


5° (Ecur) = 3/8. (2.19) 
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At the much lower energy scale u = Mz, this value is reduced by a calculable 
amount,* 
š 3 a. My 
a2 2 
Mz) =s, = =|1-C—In—4+---], 2.20 
Sy (Mz) = sy | on Me | (2.20) 


where & = &(Mz), My is the mass scale of the superheavy gauge bosons, and C 
is a constant which depends upon the number ny of Higgs doublets, 


110 — 
— (SU(5)) 
ae (SU SY—-SU (5)). 


The SU(5) extension of the Standard Model has ny =1, whereas the minimal 
supersymmetric model takes ny = 2. 

The ‘bare-bones’ SU (5) model turns out to be unacceptable. It is well known to 
give rise to an unacceptably short proton lifetime, and precision data indicate that 
the three coupling constants of the Standard Model disagree with a single unifi- 
cation point if evolved according to SU (5) [AmBF 91]. Interestingly, the SUSY 
extension improves matters in both respects. The rate at which 52 (u) ‘runs’ is 
decreased due to contributions from supersymmetric partners (‘sparticles’) of the 
known particles, and the unification scale is raised to a level (My ~ 10!° GeV) con- 
sistent with the observed proton stability. The unification condition of Eq. (2.18) 
is better satisfied. Studies continue on whether supersymmetry breaking yields 
insights regarding masses of the long-sought SUSY ‘sparticles’. 
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We shall return to issues regarding the weak mixing angle and its several definitions 
in Sect. XVI-4. Before that, however, we consider some aspects of W~ physics. 
The LEP2 (ete), the Tevatron (pp) and the LHC (pp), colliders have provided 
copious W~-related data. 


Decays of W* into fermions 


The decay of a W-boson into a lepton and neutrino pair ve is governed by the 
lagrangian,” 
depp _ _ £2 


Lier — r Wide" + ys)é + h.c. (3.1) 


i Actually, Eq. (2.20) represents a simplification in that (i) lowest-order estimates for the renormalization- 
group coefficients are employed, (ii) supersymmetry-breaking effects are ignored, and (iii) the fact that 
mt > Mz is also ignored. 

5 Although we shall denote tree-level decay widths, cross sections, etc. with a zero superscript in this section, 
for the sake of notational simplicity, we shall suppress the zero subscript for bare parameters. 
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It is a straightforward exercise to compute the tree-level decay width, 


2 
Mw x Xx 1G 
ee ae ee ee ee 2t 4? 2 
Wel 6x ( x) yt er J2 W> (3 ) 
where x = m? /M?, and we have employed Eq. (II-3.43). Including a small electro- 
weak correction, we have w_,z,, ~ 0.226 GeV. 
There exist also decays W —> Gq into quark modes (the superscripts i, 
j= 1,2,3 are generation labels), induced by the lagrangian 


g a 
i= -a WIV, Ty" + yq + he., (3.3) 


where V;; is a CKM matrix element, and the index k labels color. The lowest-order 
decay width for quark emission is 


—< x — 271/2 
(0) _ oe pee 
D T woga On p” |v; [1 —2(x +X) +(x- X) | 
xX+X (x — x)? 1 Gu s 
E 7 Mw | Vi 3.4 
i 2 2 | 0 On on V2 W | E ( ) 


where x, x are mass ratios defined as above, and we assume that all emitted quarks 
eventually convert to hadrons. Since the t quark is too massive to be a product of W 
decay, a sum over accessible quark flavors yields }`; j |V; |? = 2. For decay into 
quarks, these lowest-order partial decay widths are modified by dgcp, the QCD 
factor of Eq. (2.4b) introduced in our earlier discussion of Z° hadronic decays. 

If all final-state masses are ignored, the predicted total width for W~ decay into 
fermions is 


My (GeV) \? 
(tot) (had) (lept) w 
rE? = r + rE œ~ 2.093 GeV (a) 


80.385 oa) 


An average of data [RPP 12] yields the value ro = 2.085 + 0.042 GeV, which is 
consistent with the prediction of Eq. (3.5). The current experimental uncertainty far 
exceeds that from theory. In the limit of massless final-state particles, the branch- 
ing ratio for decay into a lepton pair €ve is (Br)e ~ 1/9 (C=e, u, Tt), while 
inclusive decay to a mode containing a positively charged quark q (q =u, c) gives 
(Br), = 1/3. 


Triple-gauge couplings 


The SU(2) x U(1) lagrangian of Eq. (II-3.10) and the SU(2) field strength tensor 
of Eq. (II-3.11) alert us that there will be trilinear and quadrilinear couplings of the 
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gauge bosons. We shall limit our discussion here to the so-called charged triple- 
gauge couplings (TGCs). Upon using Eq. (II-3.30) to replace the neutral gauge 
bosons B,,, w3 with the physical fields A,,, vine we can write an effective WWV 
(V = Z°, y) lagrangian as® 


Lwwv =igwwv fe (wi, w” = Wy W"") V” +Ky WIW, V” 
+i wi weve (3.6a) 
Mh, pu v A . 


where W,» = 0,,W,—0,W,, Vav = 0nV.—90,V,, and gwwy represents the coupling 
strengths 
&wwy = 7e, Lwwz = —e cot Oy. (3.6b) 


The above lagrangian is constrained to contain only terms which are invariant under 
charge-conjugation (C), parity (P), and SU(2) x U(1) gauge transformations. In 
the Standard Model, the individual couplings in Eq. (3.6a) become 


gr =l, ky = 1, Ay =0 (Vaz Vs (3.6c) 
and are consistent with the following constraint of gauge invariance, 
kz = gf — (K, — 1)tan’ 6, àz = oe (3.6d) 
A recent review of LEP experiments gives [Sc et al. 13] 
g7 = 0.9847) 0, ky = 0.982 + 0.042, à, = —0.022+0.019, (3.7) 


consistent with Standard Model expectations. 

We can read off static electromagnetic properties of the W boson upon taking 
V =y. The decomposition in Eq. (3.6a) allows for the existence of a magnetic 
dipole moment uw and an electric quadrupole moment qw, 


e e 
uw = IM (d +k, +Ay,), qw = “m — Ay), (3.8a) 
or to lowest order in the Standard Model (SM), 
uY = e/My, ge =e/M,. (3.8b) 


A number of experimental studies of the TGCs, especially data from the LEP2 
ete, the Tevatron pp, and the LHC pp colliders, has emphasized searching for 
anomalous TGCs, often expressed in terms of the five quantities, 


Agl =g -1, Axz=xkz—1, Ak,y=x,—-1, dz, Ay. (3.9) 


6 Unlike TGCs with two WF bosons, purely neutral gauge-boson vertices are not present at tree level in the 
Standard Model. 
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These constitute anomalous behavior in that they vanish for the Standard Model 
values in Eq. (3.6c). Currently, no experimental evidence exists for any of the 
anomalous TGCs, for example 


—0.038 < Az < +0.031, —0.111 < Ax, < 0.142 [CMS], 
—0.074 < àz < +0.073, —0.135 < Ak, < 0.190 [ATLAS]. (3.10) 


The status of recent bounds is indicated by the results the LHC detectors. ATLAS 
and CMS will be performing further studies at higher energies and, even lacking 
discovery of such effects, will supply ever more stringent bounds on anomalous 
behavior. 

We can expand the preceding discussion to incorporate possible violations of 
parity and charge-conjugation invariance, for which an appropriate effective 
lagrangian Lwwg which does just this is 


= ia eee AT ee ee 
Lwwv = gwwv i Wi Wve + te, Wow woe 
w 


+ 94 WÀ W, (OPV? + 8” VE) + gf EH” (Wia Ws — 3a W) - Ws) v|. 
(3.11) 


Here, K, and iy are P-violating but C-invariant, whereas g/ respects P but not 
C and gY respects neither P nor C. In particular, the W boson could itself have 
static properties which violate at least some of the discrete symmetries. For exam- 
ple, an electric dipole moment dw or magnetic quadrupole moment gw would be 
parameterized as 


é 


dw = 
W 2Mw 


~ ee ~ e m ~ 

(k, + hy) , eS a (K, = Ay) : (3.12) 
My 

Limits on the neutron electric dipole moment can be used to place a bound on 

the W electric dipole moment [MaQ 86], and an updated evaluation gives |dw| < 

5 x 10-7! e-cm. 


XVI-4 The quantum electroweak lagrangian 


In the following three sections, we shall give a simple description of how electro- 
weak radiative corrections are calculated. We begin by quantizing the classical 
electroweak lagrangian to obtain certain of its Feynman rules. We also expand on 
earlier comments made in Sect. XVI-1 regarding on-shell renormalization. 
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Classical electroweak theory of three fermion generations is defined by an 
SU(2), x U(1)y gauge-invariant lagrangian,’ 


LD = cob (fu) Wu, Bu, ® {ep}, 81, 82, À, v?) ; (4.1) 


where ® is the Higgs doublet and the collection {gf} of Higgs—fermion coupling 
constants is flavor-nondiagonal. With spontaneous symmetry breaking, all parti- 
cles but the photon become massive and diagonalization of the neutral gauge- 
boson mass matrix occurs in the basis of the photon A,, and massive gauge-boson 
Zi fields, as given at tree level by Eq. (II-3.30). In addition, diagonalization of 
the charged-fermion and neutrino mass matrices for the three-generation system 
involves additional mixing angles and phases. The physical degrees of freedom of 
the gauge and Higgs sectors become manifest in unitary gauge (cf. Sect. XV—1), 


LOY = cod (wt, 


z? sA Ho ; {my}, Mw, Mz, Mp, e), (4.2) 


Wr , 


where the fermion mixing parameters are included in the {m ;}. 


Gauge fixing and ghost fields in the electroweak sector 


The quantum electroweak lagrangian LSP will contain, in addition to the classical 
lagrangian of Eq. (4.1), both gauge-fixing and ghost-field contributions, 


cam) — ce A. Le f) + L, (4.3) 


Mixing between gauge fields and unphysical Higgs fields occurs in the covariant 
derivative of the Higgs doublet (cf. Eq. (II-3.18)),° 


D p+ 5e1B, Pe i)o 


iS K RIT (a )'z-W, PB+het---. (44) 


2 


One can arrange the gauge-fixing term to cancel such mixing contributions. 
Expressing the complex Higgs doublet in terms of the physical field Ho, unphysical 
fields x+, x3, and the vacuum expectation value v as 


1 J2X4. 

p = — f (4.5) 
V2 \Ho + ix3 +v 

we write the gauge-fixing contribution in the form, 


7 We have replaced the Higgs parameter u? by the equivalent quantity v?. 
8 Mixing also occurs, of course, between the neutral gauge fields B,,, Ww. 
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- 1 £4 g2 
cep = —— la, we — 
W 2 4 H + 2 X+ 
1 giga V ogv V 
3 W% — X3 ðu B” + x3). (4.6) 
“Ee; 2 -5 2 


It is not hard to see that cancelation of the unwanted Higgs—gauge mixing terms 
occurs for arbitrary values of the gauge-fixing parameters +30. Even with this 
cancelation, there remain in LE, 2 quadratic terms containing the unphysical Higgs 
fields, and such terms will contribute to the propagators of these fields. 

As explained in App. A-6, once the gauge fixing is specified as in Eq. (4.6), the 
structure of the Faddeev—Popov lagrangian LE of ghost fields is determined. For 
the electroweak sector, it turns out that there are four ghost fields, 


LEY = L (ew, cp). (4.7) 


These are associated with the four gauge fields W,,, B, which appear in the original 
SU (2) x U (l)y symmetric lagrangian. 


A subset of electroweak Feynman rules 


The full set of electroweak Feynman rules is rather lengthy and we refer the reader 

to the detailed discussions in [BöHS 86, AOHKKM 82] or to the summary in 

[Ho 90]. A few of the more useful rules, expressed in terms of bare parameters 
9 

are 


fermion W-boson vertex: 
e , , 

—i —— V; [yu + ys) a SORN (4.8a) 

W225. il 7 lus 


fermion Z-boson vertex: 


[yug + ela B a (4.8b) 


28 Cry 


9 For notational simplicity, we suppress the zero subscript in the following discussion. 
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W-boson propagator iD (q): 


| + Jug (l = E) | ZEE emems> u (4.8c) 


= i i 
q’ — Mi, +ie q? — &,Mi, + ie 


Z-boson propagator i D® (q): 


quq (l — &z) 
— VANNINA : 
| swt | u (480) 


> i 
q’ — M3 +ie 
unphysical charged Higgs propagator i A“ (q): 


j q 
—— ee eee (4.8e) 
q — E My tie 


In the above, (Vj;) is a matrix element for quark-mixing, ge A are given in Eq. (II- 
3.41), and £z is defined by expressing the gauge fixing in the form of Eq. (4.6) but 
using the physical neutral fields. 

As seen in Eqs. (4.8c), (4.8e), each boson propagator is explicitly gauge- 
dependent and, in particular, the propagator of the unphysical x; vanishes in the 
E, — oo limit of the unitary gauge. This is as expected, because only physical 
degrees of freedom appear in unitary gauge. In fact, the absence of unphysical 
degrees of freedom in unitary gauge would appear to be an appealing reason for 
carrying out the computation of radiative corrections in this gauge. However, there 
is a ‘hidden cost’. In unitary gauge, the W~ propagator of Eq. (4.8c) becomes 


= Ew F quq / My 


iDO 
uv (9) E ET 


unitary 


(4.9) 


and the high-energy behavior produced by the g,,q,/M a term makes this a ques- 
tionable choice for doing higher-order calculations. Instead, as the price for accept- 
able high-energy behavior, many opt to accept the presence of unphysical fields. 
One popular choice of gauge fixing is the ’t Hooft-Feynman gauge, defined by 
setting all the gauge-fixing parameters equal to unity, &; = 1. In this gauge, the 
lowest-order propagators for the physical gauge bosons and unphysical Higgs and 
ghost fields have poles at either Mj, or MZ. This condition can be maintained in 
higher orders by a suitable renormalization of the gauge-fixing parameters. 
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On-shell determination of electroweak parameters 


The topic of electroweak radiative corrections to Standard Model quantities has 
been well developed over many years of research and by now there exists an 
impressively large literature. To focus our attention, there is one aspect that we 
will particularly address in the following. Given that the largest mass parameter in 
the Standard Model is that of the top quark, a natural question regards the effect 
m; has on the set of electroweak corrections. The answer turns out to depend on 
the renormalization prescription followed, its effect being largest in the so-called 
on-shell scheme. 

Two sets of electroweak parameters appear in the classical lagrangians of 
Eqs. (4.1), (4.2), 


{gy}, 81, 82, À, V? (Eq. (4.1)), 


classical parameter sets = 
{ms}, Mw, Mz, My,e (Eq. (4.2)). 


Considered as bare (input) parameters to the quantum theory, these obey the simple 
tree-level relations 


2 2 
82.0 gio + 820 5 = Z 
Mw o = v> Marana a e3 = 810 + 820 
Efo 
Myo = Voy 2X ; m fo = Vo— =. (4.10) 
Á xa 


At this stage, there are several (equivalent) expressions for the bare weak mixing 
angle, e.g., 

2 2 
Myo 2 81,0 


l or se 
2 w,0 2 2 
MZ o 81,0 + 820 


a a (4.11) 
The second relation becomes Eq. (2.15) in the MS renormalization. 

Radiative corrections will generally modify tree-level relations and, as a result, 
necessitate a precise definition of the weak mixing angle. Following the analysis in 
[Si 80], let us compare the parameter subsets (21,0, 82.0, up) and (eo, Mw.o, Mz.o). 
Each of these bare quantities will experience a shift, 


81.0 = 81 — 581, 82,0 = 82 — 982, ug =v — bv’, 
eo = e — be, Myo = My +5My, M} =M; +8M}. (4.12) 


In on-shell renormalization, the theory is specified in terms of e, Mw, and Mz. 
Moreover, the following relations are arranged to hold order by order, 


2 


2 2 2 
+ 
agtt Maav, moolto) 


4 4 4 
These equations constrain the effects of radiative corrections upon the parameters. 


, (4.13) 
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By differentiating the three relations in Eq. (4.13), one finds after a modest amount 
of algebra the conditions, 


ôgI w 
z? 1 1 1 Mm, 
302 A 2 5m2 
2 Se Sey ie 
82 ee) 2 M3 
: fa g J? 
du“ sZ sz ôe” 
v2 e2 


Also in on-shell renormalization, one defines the weak mixing angle in terms of 
the masses Mw, Mz as in Eq. (2.17a). Since this relation is to be maintained to all 
orders, the bare value com will be modified by shifts in the W and Z masses, 


2 2 
mg M3 o M3 T 5M; 
ôM?, 8M2 
~s f — cot? Ow ( x- za) . (4.15) 
My Mz 


For any renormalizable field theory, it makes sense to express results in terms 
of the most accurately measured quantities available. Thus, it is preferable in the 
electroweak sector to replace Mw by G,, and work with a modified parameter set, 


œ~! = 137.035999173(35), 
Physical parameter set = } G, = 1.1663787(6) x 1075 GeV~’, (4.16) 
Mz = 91.1876(21) GeV. 


To accomplish this, the relationship G, = G,(a, Mw, Mz,...) can be used to 
replace My by G,. 
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It is is evident from Eq. (4.14) that the parameter shifts Se”, 5M 3 and 6M play an 
important role in the study of electroweak radiative corrections. We have already 
determined from our analysis of QED (cf. Eq. (II-1.30)) that 


2 
oe —T1(0), (5.1) 


ee 

where the photon vacuum polarization TI (q?) appears in Eq. (II-1.26). In this sec- 
tion, we shall compute the portion of Mj, and 6M arising from the fermionic va- 
cuum polarization contributions to the W* and Z° propagators. As a consequence, 
we shall be able to reveal the presence of propagator contributions which scale as 
O(G wm), 
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The charged gauge bosons W* 


The radiative correction experienced by a W~ gauge boson propagating at momen- 
tumq i di f a self- functi Ca 
q is expressed in terms of a self-energy function, IT\,(¢~), 


TIA", (4°) = Aww (9°) 8%” — Bw (97) aq”. (5.2) 


(For notational simplicity in this subsection we denote W and Z boson subscripts 
for the quantities TI“” , A, and B in terms of lower-case Roman indices.) Although 
a vector-boson propagator iD,,,(q) generally contains terms proportional to g,,, 
and to g,,qy, it will suffice to study just the g,,, part. As indicated at the end of 
Sect. II-3, the g,q, dependence is absent if the gauge boson couples to a con- 
served current or will give rise to suppressed contributions if the external parti- 
cles have small mass. Thus, we have for the W propagator in °t Hooft-Feynman 


gauge, 


—18 uy —ig uv —i gua : 2\ aß —iggv 
—iAww(q )g “Oo að 
q? — Mio q? — Myo Fou Mn 
=i v 
>? a — 2 
q — My o + Awwlq ) 
—i guv 


= ; 5.3 
q? — Mj, + Aww(q2) — Mi, To 


where we have substituted for the bare W mass using Eq. (4.12). 
Let us now calculate the loop contribution of a fermion—antifermion pair fi fr 
to the self-energy Aww(q?). We begin with 


oe ae 
—i TI (q?) = B Nfi h 
fih 
{or “(1 +5) y+ 5) —— 
x | ——Tr — 
Qa)? y y5 pom eg =e , 


where 1s p = Nel Vp pl? for the case when the fermions are quarks. Aside from 
the occurrence of the 1 + ys chiral factor and the nondegeneracy in fermion masses 
mı, m, the above Feynman integral is identical to the photon vacuum polariza- 
tion function of Eq. (II-1.20). It is thus straightforward to evaluate this quantity in 
dimensional regularization, and we find for the g% component, 
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; Ny, 7,85 2 y 


1 a ae 
-3 f dx xd -om C 
0 H 


1 2 a2] 
-F fmt +m’ [2 — Xin var] - f iin ZA 2), 

2 € 2 0 u? 
(5.5) 


where M? = mix + m3(1 — x). Since the W~ boson is an unstable particle with 
decay rate Tw, the function Aww(q°) is complex-valued, and we consider its real 
and imaginary parts separately. 

From Eq. (5.5), we see that Re Aww(q°) is divergent. One can construct a finite 
quantity Avil@) by defining the field renormalization, W, o = (Z5”)'/7W,,, and 
constraining 6Mj, and ô Zy to cancel the ultraviolet divergence in Re Aww(q?), 


Aww (1°) = Aww (47) — 6M3, + 8Z¥ (q? — M2). (5.6) 

It follows from Eq. (5.6) that the mass shift Mj, is fixed by 
5My = Re Aww (My), (5.7) 
and the fi fo contribution to the field renormalization, which ensures that Aww 


(M;, )=0is 


ZY Ifi hl = "n48 Ee inva |. (5.8) 


To obtain a relation for the imaginary part of the self-energy, we recall that insta- 
bility in a propagating state of mass M is described by the replacement M — 
M — iT /2. This produces the following modification of a propagator denominator, 

1 1 
=> 3 
q? — M? q? — M? +iMT 


(5.9) 


where we ignore the O(T°) term. Comparison with Eq. (5.5) then immediately 
yields 


Im Aww (Mj) = MyT w. (5.10) 


We can use Eq. (5.6) to check this relation by setting q? = Mj,. If, for simplicity, 
we neglect the masses of the fermion—antifermion pair fı f2, then the imaginary 
part comes from the logarithm contained in the first of the integrals in Eq. (5.5), 


m f aio = a (5.11) 


=f 2__ag2 
u? IE P=M\, 
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and we obtain 


(tb) 
Im [aww (Mi) | G,,M3, 
lasin = ie Rar (5.12) 


where we have substituted g =4,/2M2, G,,. This agrees with the results of our 
earlier decay width calculations for W decay in Sect. XVI-3. 


The neutral gauge bosons Z°, y 


The system of neutral gauge bosons is treated analogously to the charged case 
except that we must deal with a 2 x 2 propagator matrix, and the issue of parti- 
cle mixing arises. Although the neutral channel was already diagonalized at tree 
level (cf. Eq. (II-3.30)), interactions reintroduce nondiagonal propagator contri- 
butions at higher orders. The g,,, part of the neutral channel inverse propagator 
Diiu (47), diagonal at tree level, 


ntl]uv 
2 
= : q (0) 
Dono G =i ( 0 gt R (5.13) 
has the renormalized form, 
2 A 2 Â 2 
0)—1 = q + Ayy (q ) Ay: (q e) 
Dinev (1°) > Dinter (4°) = 8w ( Asla) =? — MZ tA) 


(5.14) 


Upon taking the inverse, we obtain for the individual neutral boson renormalized 
propagators, 


jor 
DÈ (P) = —— —— Ea 
q? + Avy (9?) - Az. (a) / (@? - M3 + Âz (@°)) 

uv 2 aight 
aa a arr ae ree OTE 
q? — M3 + Ân (4?) — & (0°) / (42 + 42, (@°)) 

i ig” Â, (4?) 
w (q?) = y . (5.15) 


[e + Ayy (9°) | |? = M3 T Ay (a?) | = Az (4?) 

Observe that there is indeed a particle-mixing propagator, D57, proportional to the 
the reduced self-energy Ayz (q7). It might appear from Eq. (5.15) that Z°-photon 
mixing gives rise to a photon mass contribution. However, one arranges as a renor- 


malization condition that Ae (0) =0, and the photon remains massless under elec- 
troweak radiative corrections. 
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If we consider only the vacuum-polarization loop contribution due to a fermion 
of mass m, we obtain for the Z° self-energy, 


$ 2 4 aß 
_iT1°? (q?) = N. =) J d'p N 1 
mg (a) =N C (27)* (p? — m?) ((p — q}? — m?) = 


where m is the fermion mass, Ne is a quark color factor, and 


Nw = gif Tr [y“ py’ (p = ar m yy? 
+ aT [ppp my] 6a 


The quantities in N°? are just those expected from the coupling of fermion f to 
the neutral weak current. We then obtain, using dimensional regularization, 


2 2;,,(f)2 (f)2 
Fy, 2 85Ne |24 (8&7 +g ) [2 y ne 
AYN?) = leat | 3 E 2 are 


1 2 — g2 — 
-3 f dx x(1 me hl oo 2) 
0 


u? 


ren We Li 
44m2gM2I= -4 in Jae — = J dx In 
€ 2 2 0 


m? — q?x(1 — x) }| 


u 
(5.18) 


It is also easy to demonstrate that the photon-Z° self-energy AŞ is propor- 
tional to AQS ) for the case of a charged-fermion loop contribution, 
CP) gr) 
Ay: (P) = =A; (0°), 5.19 
yz (4 ) eww Oy vy (q ) ( ) 


where Q is the electric charge of the fermion. 
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All electroweak amplitudes will be affected by radiative corrections. We have 
already pointed out our interest in potentially large contributions arising from the 
heavy masses m, and My. We shall find leading corrections which are quadratic 
in the top-quark mass (O(G yn)? To begin this section, we consider corrections 
to the coefficients ps and xp of Eq. (1.5), followed by an analysis of the quantum 
correction known as Ar, and finally the Z — bb vertex correction. A historical 
overview of electroweak corrections appears in [FeS 12], and a thorough state-of- 
the-art presentation is given by Erler and Langacker in [RPP 12]. 


10 Recall from Chap. XV that corrections at leading order are only logarithmic in the Higgs mass 
(O(In[M}, /MF))). 
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Fig. XVI-3 Top-quark corrections to the (a) W*, (b) Z° propagators. 


The O(G,m?) contribution to A, 


Contributions to pş and «+ can be classified as either independent of the external 
fermions (universal ) or explicitly dependent on the fermion flavor f (nonuniver- 
sal). Recalling that at tree level these quantities reduce to unity, we have 


pp =1+ Ap + (Ap) kp =1t Act (An, 61 


nonuniv? 


where Ap and Ax denote universal pieces. It should be apparent that W5- and 
Z°-propagator corrections, like those in Fig. XVI-3, occur independent of the 
external fermions and are thus ‘universal’. Nonuniversal effects have been found 
to be small (i.e. subdominant) except for the Z? —> bb vertex. The universal 
effects are of special interest because they turn out to be the primary source of 
O(G ym?) radiative corrections [Ve 77a, ChFH 78]. As such, in the following we 
shall approximate 


2 


G 
Ap = (Ap) +++: , Ak = 3 (AP) +, (6.2) 
where 
3G „m? m 2 
Ap), = —— ~ 0.00942 x (——__—_.) . 6.3 
P= ap j m N) (6.3) 


Observe in Eq. (6.2) that Ax is proportional to Ap. This is a result of the Standard 
Model; in general, these quantities are independent. 
The quantity Ap can be defined as a correction to the rho parameter of Eq. (1.3), 


— 1 Dz = 0) 
© Zo Dw(q? = 0) 


M3+5M3  (-M}—5MZ+A,,(0)) 


3 a (6.4) 


po + Ap = 


or 


Azz(0) — Aww(0) 
Ap = M2 — YE 7 (6.5) 
zZ W 
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Observe that Ap is finite since the singular terms in Eqs. (5.5), (5.18) cancel. If we 
set mı =m, and m = m, in Eq. (5.5) and include both ¢-quark and b-quark loops 
in Eq. (5.18), a simple calculation reveals that Ap = 0 in the limit that m, = mp. 
However, the leading term in the small m, limit gives 


2 Ne 1 r) 2 2 
(Ap); = a | dx [in -emm | + 
m My Jo u u 
2 2 
83N: mî 
= — 6.6 
64m? M3 | (65) 


Substitution of N.=3 and G,/ J/2= g /8M a yields the result shown in 
Eqs. (6.2), (6.3). 

This quadratic dependence on the heavy-top-quark mass is in striking contrast 
with the behavior observed for the photon self-energy (cf. Eq. (II-1.26)). In the 
heavy-fermion limit, the photon vacuum polarization exhibits instead the decoup- 
ling result O(m;’). The reason for this difference is that QED is a vector theory, 
whereas the charged and neutral weak interactions are chiral. Indeed, one can show 
(cf. Prob. XVI—2) that the decoupling expected of a vector interaction results when 
left-handed and right-handed self-energies are averaged. However, equally impor- 
tant is the fact that as m, grows while m, is kept fixed, the weak doublet is being 
split in mass. Thus, decoupling of the top quark in the large m, limit should not 
be expected because if we were to integrate out the top quark, we would no longer 
have a renormalizable theory — the remaining low-energy theory would have an 
incomplete weak doublet. Early contributions to this subject appear in [Ve 77b] 
and [ChFH 78]. As noted above, if both members of the doublet are taken to be 
equally heavy (m, =m, — œ), there would exist no quadratic dependence on the 
heavy-quark mass, and the decoupling theorem (cf. Sect. [V—2) would be satisfied. 
It is the large splitting in the weak doublet which leads to the observable violation 
of decoupling. 

Even though two different renormalization schemes must give the same final set 
of results, intermediate details will generally differ. For example, the leading m, 
behaviors for the coefficients pp and « ¢ of Eq. (1.5) are [RPP 12], 


2 


c 
on-shell: pr ~ 1+ (4p) +--+- ep~ 14 Ap t 


MS): p~li+-- helei (6.7) 
where (Ap), is defined in Eq. (6.3).!! 


11 The case f =b is special; the leading behaviors are ôf ~ 1 — 4(Ap);/3 and êf ~ 1 + 2(Ap)1/3. 
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Fig. XVI-4 (a)-(b) Vertex, (c) propagator, and (d) mass-shift counterterm cor- 
rections to muon decay. 


The O(G,m?) contribution to Ar 


The quantity Ar describes the effect of electroweak corrections on the leading 
order relation which defines the muon decay constant. In particular, the tree-level 
relation of Eq. (II-3.43) becomes modified by the radiative corrections of 
Fig. XVI-4 [Si 80, BuJ 89], 


G > G 2 
m0 £20 N noZ & 


J2 T 8M2, Vi sm} 
It is to be understood in Eq. (6.8a) that ‘G,’ is determined from the muon lifetime 
with the photonic corrections described in Sect. V—2 already taken into account. 
Thus, Ar contains only the remaining electroweak effects. 
To trace the origin of the quantum correction, we observe first the effect of the 
WŒ self-energy on the bare relation in Eq. (6.8a), 


Gu 80 1 8o l Aww (0) 


V2 8 g?—Miyy + Aw) 8M0 Mi, 


[1+ Ar]. (6.8a) 


+ (6.8b) 


where we have taken q? ~ 0. Next, we replace the bare parameters 82o and M wo 
by their physical forms as in Eq. (4.12). Comparison with Eq. (6.8a) directly yields 


Ar : (6.9) 
My 83 
Upon using Eq. (4.14) for 585, we can rewrite Eq. (6.9) as 
ôe? c? [8M} 8M? Aww(0) — ôM? 
Ar mn e _ Cy Z _ v ( ) - WwW (6.10) 
e s? LM} My My 


Recalling that the W* and Z° mass shifts can be related to the self-energy functions 
Aww (Mj) and Au(M3), it should be clear that Eq. (6.9) expresses Ar entirely 
in terms of calculable quantities.'? Although each of the terms in Eq. (6.10) is 


12 There are additional radiative corrections, such as the ‘box’ diagrams, which we shall not discuss. 
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divergent, the overall combination is finite. A number of rearrangements and alge- 
braic steps can be used to isolate the leading contributions, and one finds 


Ar = Aa + Ary + (AP) rem, (6.11) 
where 
a(M3)-—a A 2 
Aa = du E > ÎI (M3), and Arw = -KAj (6.12) 
a 


Ap is given by Eqs. (6.5)-(6.6), and (Ar);em contains smaller finite contributions. 

The largest contribution to Ar is Aq, the shift in the fine-structure constant. 
Although we have previously expressed the variation in œ(q°) in terms of fermion 
masses (cf. Eq. (II-1.38)), the difficulty in precisely determining quark masses 
would appear to undermine an accurate evaluation of Aœ. However, one can use 
dispersion relations to relate the hadronic contribution to the vacuum polarization, 
Mhaa(q2). directly to cross-section data. Recalling Prob. V—2, we have 


iM qD = ie? l d*x e'T™ (0T (J£ (x) Jz, (0))10) 


= (q"q" — q’g"”) Tna (4°). (6.13) 


The imaginary part of Mpaa (°) is expressible in terms of cross-section data evalu- 
ated at invariant energy q’, 


; e had 
In Tha (2) = 2R(@2) wia Re) = LES Oy 


Thus, we obtain a dispersion relation for the subtracted quantity Eiai q”), 


Mag (q) = ae *) — Mnaa(0) 


| “Ja O RO 
T +f (6.15) 
4m2} SO s(s — q? — ie) = 47 J ie)’ 


where sọ denotes the point at which data become unavailable. For energies 
above so, a perturbative representation is used to approximate R(s). The result of 
Eq. (6.15), when added to the lepton contributions, implies a value for a~!(M 2) 
[DaHMZ 11], 


aT! (MZ) = 128.952 + 0.014. (6.16) 


Some feeling for the magnitudes of ‘Ar’ corrections is given in the following 
(the numerical values have been taken from [KuMMS 13]): 


13 There are minor differences in various evaluations cited in the literature, depending on how the perturbative 
estimate is performed or on the particular renormalization scheme. 
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Fig. XVI-5 Top-quark corrections to the Z°bb vertex. 


TO 


Ar=1- = 0.0350(9), 
TA 
Anai f = 0.0699 (7)(4), 
/2G,,M},S2(Mz) 
Ap = ai = 0.0598(4). (6.17) 


=1— = = 
V2G.M3C2(Mz)S2(Mz) 


The above relations, although exact at tree level (the ‘Ar =0 limit), lead to the 
different values shown away from this limit. As before in this chapter, the quantities 
se (Mz) and & (Mz) are defined in MS renormalization and evaluated at scale Mz. 
In order to obtain the above form for Ar, we have replaced [1 + Ar] in Eq. (6.8a) 
by 1/[1 — Ar], which is valid in our first-order analysis. 


The Z — bb vertex correction 


The preceding analyses of Ap and Ar could very well be carried out for any other 
electroweak observable. In most cases, we would again find important O(G „m?) 
radiative corrections. Thus, for example, the Z° width for decay into lepton £ 
(€ =e, u, T) has the form 


Doe =r gL + (Ap) +--+], (6.18) 
and grows quadratically with increasing m, [AKBYR 86]. The origin of this effect, 
the one-loop tz contribution to the Z? propagator, is identical to that discussed 
earlier. 

Interestingly, however, a more complete calculation reveals a slight decrease to 
occur in the decay rate Izo, as m, grows. This is because, although the decay 
amplitude contains a (universal) propagator contribution proportional to (Ap);, an 
even larger effect, the vertex correction of Fig. XVI-5, contributes with opposite 
sign [AkBYR 86, DjKZ 90],'* 


14 Due to cancelations, the vertex correction turns out not to affect asymmetry phenomena, such as the b-quark 
(b) 
forward—backward asymmetry A pp- 
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Tasis ia f + 55 CEA +> | , (6.19) 


where the Z°bb vertex correction is given by 


20 130 
(Av), = -F ey — on 


— 6.20 
57x M3 ee 


The dd, ss modes also contain virtual t-quark vertex corrections, but they are 
greatly suppressed by the tiny accompanying CKM factors |V4|? (i =d, s). Recall- 
ing the characterization given in Sect. XVI-1 of radiative corrections as either ‘uni- 
versal’ or ‘nonuniversal’, one may interpet the Z°bb effect as a nonuniversal term 
which contributes as 


4 8 1 2 
(Ap)? = —2(Ak) = —-Ap— “( + Jaz . (6.21) 


t 
nonuniv nonuniv 3 
3 M2, 


Although O(m?) corrections are the most important, ©(ln(m?/ M3) logarithmic 
dependence has been included in Eq. (6.20) because it has a nonnegligible numer- 
ical impact. 


Precision tests and New Physics 


In precision electroweak tests, about 20 (mainly W* or Z°) observables are fit 
to Standard Model predictions (e.g. see [Ba et al. (Gfitter group) 12, RPP 12]). 
Such tests are based on the availability of high-quality data (with precision at the 
1% level or better), multi-loop theoretical Standard Model predictions, and sophis- 
ticated software packages.!> In view of the LHC discovery regarding the Higgs 
boson, the list of detected Standard Model particles is now complete. Consequently, 
there will be, more than ever, an emphasis on using precision tests to probe contri- 
butions from beyond the Standard Model. 

As was noted ever since the first electroweak corrections were calculated (e.g. 
[Ve 77a]), physics associated with a large-energy scale A should affect the gauge- 
boson self-energies —i TI} (q4) (i=yy, yz, WW, ZZ). In particular, the —i T}, (q) 


Hv 
could contain loop corrections (sometimes referred to as oblique corrections) from 


15 Let us describe just a few of these. The Zfitter collaboration, begun in 1985 ([AkARR 13]), established a 
FORTRAN library of Standard Model predictions for ete~ —> f f(+y’s) at energies ,/s =20 > 150 GeV 
using the on-shell renormalization scheme. The LEP electroweak working group LEPEWWG was founded 
in 1993 to perform fits of LEP and Tevatron data, particularly of Z-pole observables such as the effective 
weak mixing angle 5} of Eqs. (2.2b),(2.14), using Zfitter in part as input. A more recent effort using on-shell 
renormalization to perform electroweak global fits is the Gfitter group. The Global Analysis of Particle 
Properties (GAPP) software is employed by the Particle Data Group [Er 00]. This is a special purpose 
FORTRAN package, which performs calculations and fitting procedures and utilizes MS renormalization, 
Finally, the Heavy Flavor Averaging Group provides updates to world averages of heavy-flavor quantities. 
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‘new’ particles. For A? >> q’, one would expect rapid convergence of an expansion 
for —i TI} (q) in powers of q?/A?’, yielding the following effective low-energy 


uv 
description, 

-iTi (d) = Suv (Ai + gA) +. (6.22) 

This description involves eight free parameters, A,,,..., A’,,. However, the condi- 


tions IT,,,,(0) = I,,,(0) =0 reduce this number to six. An additional three parame- 
ters can be absorbed into the renormalization of a, G,,, Mz, which experience the 
shifts [BaFGH 90], 


Say Gu Ave. ôM} Ax 


’ = m z A, 
a me "Ge M3 M3 


zz" 


(6.23) 


The three remaining parameters may be chosen to be quantities known as S, T, U 
and defined as [PeT 90] (we employ MS renormalization here [RPP 12]) 


â (Mz) < _ k (Mz) - ARPO SAGO A 2] 


45323 M3 so M3 M3 
AN 0) ASO) 
amor =| aa = |: 
My M; 
(M2) es q yy = | Ate (Miv) -AWO _ ez APO _ AYO) 
453 7 My ŝz Mz M; | 


(6.24) 


where the superscript (NP) refers to contributions only from New Physics. Clearly, 
these S, T, U parameters are defined so as to vanish in the limit of only Stan- 
dard Model physics. If nonzero, they would appear as new contributions to various 
observables, e.g., 


2 
a 1 — &(Mz)T 
MEM 1 — G MY? S/QV2r) 


ue i 7 1 
My | 1- G MẸ (S + U)/QV2x) 


3 

us perro 1 

L Dl? on = 7 ’ (6.25) 
M, E i 1 — &(Mz)T 


all of which compare the experimental value with the Standard Model (SM) predic- 
tion. In this way, bounds are placed on the New Physics parameters and the results 
found in [RPP 12] are!6 


16 The range of Higgs-boson masses 115.5 < My (GeV) < 127 was used as input. 
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0.11 0.11 
S = 000") as PS 00275 U = 0.08 + 0.11, (6.26) 


and are consistent with Standard Model expectations. At present, the precision 
electroweak fits do not yet display evidence for effects beyond those predicted by 
the Standard Model. 

The literature contains several other possible New Physics parameterizations. 
For example, if the scale of New Physics is not much larger than the Standard 
Model weak scale, then parameters X, Y, V, W will, in principle, contribute to the 
fitting procedure [BuGKLM 94, BaPRS 04]. However, their determination requires 
data at energies higher than the scale set by the Z-boson mass and so, e.g., in the 
work of [Ba et al. (Gfitter group) 12], the quantities X, Y, V, W are set equal to 
Zero. 

As we have emphasized throughout this book (e.g. Sect. [V—9), the effects of 
heavy particles can be analyzed theoretically by using effective lagrangians and the 
preceding analysis can be expressed naturally in this language (e.g. see [Sk 10]). 
These must respect the SU (2); x U(1)y gauge symmetry, but may or may not 
include the extra custodial SU (2); x SU (2)r invariance of the Higgs sector with 
doublet Higgs fields. There will be a tower of such operators, beginning with those 
of dimension-six. However, not all dimension-six operators are relevant to elec- 
troweak phenomenology. Examples of these are (H'H ‘a and H'HD, H'D“H, 
the point being that processes having Higgs bosons as external states are presently 
experimentally inaccessible. Instead, we consider the two operators.!” 


Os = Ho; HFi,BY, Or =|H'D,H|’, (6.27) 


where BY’, F A are, respectively, the field strength tensors defined in Eqs. (II- 
3.11), (II-3.12). These operators, together with the usual Standard Model 
lagrangian Lsm, can be added together to form 


L = Lsm + asOs + arOr. (6.28) 


The New Physics coefficients as and ar will each carry units of inverse squared- 
energy and the Higgs fields in Eq. (6.27) will each contribute a factor of the 
symmetry-breaking energy v, so that in this approach the S$, T parameters will 
obey 


S xas v?, T xar i (6.29) 


and we leave evaluation of the proportionality factors to an exercise at the end of 
the chapter. Although we do not survey New Physics models in this book, a large 
variety is discussed in [Sk 10, RPP 12, Ba et al. 12], among others. 


17 Tt turns out that associated with the parameter U will be a dimension-eight operator. 
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Problems 


(1) Tree-level coefficients in effective lagrangians 

(a) Using the simplest quark—parton description of protons and neutrons as 
uud and ddu composites, reproduce the content of the tree-level expres- 
sions of Eq. (1.7) by determining the quantities R, o and R5 o for scattering 
from an isoscalar target. It might be helpful to first refer to a summary of 
parton phenomenology, e.g., as in [RPP 12], for guidance. Suppose a neu- 
trino deep-inelastic experiment reports R, =0.3072 + 0.0032. Infer from 
this a central value and an error estimate for the tree-level quantity ee 


(b 


wm 


Likewise, reproduce the tree-level expressions for the coefficients C ad of 
Eq. (1.11), and infer a value for <4 assuming the value Qw = —69.4 + 
1.55 + 3.8 for the weak nuclear charge. 

(2) Power-law radiative corrections 

(a) Verify the statement that if m,=m, — oo, there is no quadratic mass 
dependence in the calculation of Ap. 

(b) From the combination of Dirac matrices appearing in Eq. (5.4), it is evident 
that the self-energy amplitude has a ‘left—left’ (LL) chiral structure. To see 
how this affects the result, repeat the analysis of Eqs. (5.4), (5.5) except 
now employing a ‘left-right’ (L R) chiral structure. 

(c) By averaging the LL and LR self-energies and passing to the limit mı = 
m —> m, reproduce the g%? part of the photon self-energy of Eq. (II-1.26). 

(d) If we had a purely left-handed U (1) theory, the vacuum polarization would 
grow with m? as m; — oo. How can this be consistent with the decoupling 
theorem? 

(3) Effective field theory and the S,7 parameters 
Determine the proportionality factors which were not provided in Eq. (6.29). 


Appendix A 


Functional integration 


In this appendix we outline the basis of functional methods which are employed 
in the text. Path-integral techniques appear at first sight to be rather formal and 
abstract. However, it is remarkable how easy it is to obtain practical information 
from them. Very often they add insight or new results, which are difficult to obtain 
from canonical quantization. 


A-1 Quantum-mechanical formalism 


Before attempting to address the full field-theoretic formalism we first review the 
application of such techniques within the more familiar setting of nonrelativistic 
quantum mechanics in one spatial dimension. Unless otherwise specified we here- 
after set h = 1. 


Path-integral propagator 
Simply stated, the functional integral is an alternative way of evaluating the 
quantity 


—iH(tf—ti 


D(xf, tps xi, ti) = (xple lxi) = (xp, ty|xi, ti). (1.1) 


This matrix element, usually called the propagator, is the amplitude for a particle 
located at position x; and time t; to be found at position x ¢ and subsequent time t p. 
The propagator can also be written as a functional integral 


D(xf, tf; Xi, ti) = l Dixe°"™, (1.2) 


where the integration is over all histories (i.e. paths) of the system which begin at 
spacetime point x;, t; and end at xp, ty. The paths are identified by specifying the 
coordinate x at each intermediate time ż, so that the symbol f D [x(t)] represents 
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a sum over all such trajectories. The contribution of each path to the integral is 
weighted by the exponential involving the classical action 


tf m ð 
Six] =| dt (Fs (t)- V(x), (1.3) 


which, since it depends on the detailed shape of x(t), is a functional of the trajec- 
tory.! Although the validity of the path-integral representation, Eq. (1.2), may not 
be obvious, its correctness can be verified by beginning with Eq. (1.1) and breaking 
the time interval ty — t; into N discrete steps of size € = (tf — t;)/N. Using the 
completeness relation 


CO 

1= J aX, Xn) (Xn, 
=00 

one can write Eq. (1.1) as 


oe oo 
Depia = | diwai f dx, 


—oo oO 


eH ape zieh 
(xnle "t? |xn—1) (xne xn) ++ > ale 


Ixo), (1.4) 


where x9 = x;, xy = Xf. In the limit of large N the time slices become infinitesi- 
mal, implying 


(xele. xe) = (xele (s 
. : p2 
_ eV OW) (ype iT [x p_1) + Ole”), (1:5) 


Inserting a complete set of momentum states and introducing a convergence factor 
2 . : 
e “?’ for the resulting integral over momentum, we have 


2 
(xele! xei) = lim f 2E ere)? 
k>0 zoo An 


m ¿m n 2 
= | See e! ze (xe-40-1) i (1.6) 


l Tris important to understand the difference between the concept of a function and that of a functional. 
A real-valued function involves the mapping from the space of real numbers onto themselves 


reals <— [f : reals]. 


On the other hand, a real-valued functional such as S [x(t)] is a mapping from the space of functions x (t) 
onto real numbers 


reals <— [S: x(t)]. 
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Upon taking the continuum limit we obtain 


D (xf, tf; Xi, ti) 


N-1 
l m \? a 
= lim ( - ) I] dx, |e 
Noo \27i€ | 00 
n= 


It is clear then that we can make connection with Eq. (1.2) by identifying each path 
with the sequence of locations (x1, ..., xy—1) at times €, 2€,..., (N —1)e. Integra- 
tion over these intermediate positions is what is meant by the symbol f D [x(t)], 
viz. 


2 
ix (: CD eve) 
À (1,7) 


m N2 po 
[rors jim, =) I / dar (1.8) 
n=1 "© 


iS[x(t) 


Each trajectory has an associated exponential factor e ! where the quantity 


Sm Ge = 0-1" 
Sx] = Die (Ga = van) (1.9) 
f=1 


becomes the classical action in the limit N —> oo. We have thus demonstrated 
the equivalence of the operator (Eq. (1.1)) and path-integral (Eq. (1.2)) representa- 
tions of the propagator.” It is important to realize that in the latter all quantities are 
classical — no operators are involved. 

The path-integral propagator contains a great deal of information, and there are 
a variety of techniques for extracting it. For example, the spatial wavefunctions 
and energies are all present, as can be seen by inserting a complete set of energy 
eigenstates {| n)} into the definition of the propagator given in Eq. (1.1), 


D(xy, ty 1,4) = Do rape er. (1.10) 


n=0 


2 For completeness, we note that by combining Eqs. (1.5)—(1.8), one can also write the propagator in a 
corresponding hamiltonian path-integral representation 


; dpo , api adpn-i 
D(xf,tf; x;, ti) = lim — dx; —dx - - - dxy 1 —— 
( FUP i) N00 20 l 20 2 N= 27 


2 
¿ Py 
iDo (pee-u-n-( 5 +vao]e) 
xe 


= [ DEDO iSi iHe, 


This form is useful when one is dealing with non-cartesian variables or with constrained systems. 
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In addition, other quantum-mechanical amplitudes can be found by use of the 
identity? 


(xf, tell lh) «+ x(G)) lx, ti) 


— f? [x(t)] x(t) E x(t) el A at(ZPO-VEO) (1.11) 


where ‘T’ is the time-ordered product. 


External sources 


An important technique involves the addition of an external source. In the quantum- 
mechanical case this is added like an arbitrary external ‘force’ j(t), 


. tf m +2 . 
(xp. teix tijo = / Duaje HF#O-VEO+ OO] (1.12) 


The amplitude is now a functional of the source j(¢). From this quantity one can 
obtain all matrix elements using functional differentiation, which can be defined 
by means of the relation 


Si) _ 


JH= farse -tjt = êlt— t’) (1.13) 
ôj (t') 
and yields the result we seek, 
(xf, tfl T (x(t) -+x (ta)) lxi, ti) 
an ô” 
= (—i)" ——— (xp, tf |xi, tholo (1.14) 


dj (ti)... 87 (tn) 
For many applications it is necessary only to consider matrix elements between 
the lowest energy states (vacuum) of the quantum system. This can be 


3 One can prove this relation by choosing a particular ordering, say 
ti <t <th <: <tf, 
and noting that 


(xf tfIT ty) ++ xCn)) xi, ti) = (xf, tf len) x (r—1) (ED) ti) 


n a 
= JI / dxk(xf,tf|Xn, tn) Xn (Xn, tn|Xn—1; tr—=1)Xn—1 +0 x1 (X1, til, Xi, ti), 
kale 


where we have used completeness and have defined xz = x (tg) (k = 1, 2, ... , n). The amplitudes 

(Xk, tk|Xkķk—1, tķ—1) are simply free propagators as in Eq. (1.1), and can be evaluated by means of the 
time-slice methods outlined above. Thus, the above expression is identical to the right-hand side of 

Eq. (1.11). In the case of a different time ordering the same result goes through provided one always places 
the times such that the later time always appears to the left of an earlier counterpart. However, this is simply 
the definition of the time-ordered product and hence the proof holds in general. 
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accomplished in either of two ways. First, it is possible to explicitly project out 
this amplitude using the ground-state wavefunction 


(x, t10) = yoxe E, (1.15) 


which implies 
(OIT (x(t1) +++ x(n) 10) =| ax; f dx; Wo (x5) e PF 


(xp, te|T (th) +++ x (tm)) Lai, ti) Wor) e "7". (1.16) 


However, this amplitude can be isolated in a simpler fashion. If we consider the 
amplitude (x, t¢|x;, ti) in the unphysical limit tf — —ity, ti > +i7; we find for 
large Tf + Ti, 


(xf, kel Ree ti) = 5 Wn (x) Ort (xi) erT 
n 


erat Wolx swe (x) eo Ort), (1.17) 
Generalizing, we have 
: ef Eott) 
J aan erat 


= (O[T (x (t1) ++ x(t) 10) (1.18) 


which is operationally a much simpler procedure than Eq. (1.16). 


The generating functional 
We may combine all these techniques in the so-called generating functional, 
defined by 
Zij] = im (xp tflXi ti) jo. (1.19) 
ae 
This has the path-integral representation 
Z[j]ļ= lim [rv ae dt (FPO-VE) +2010) (1.20) 
TEF 
Noting that for t; = it; and tf = —iTy, 


(xp, telix ti) > Poa pya) et —> ZO], (1.21) 


Ti, Tf Fw 


498 Functional integration 


we find that ground-state matrix elements as in Eq. (1.16) can be given in terms of 
the generating functional Z[/], 


1 ô” 
(OIT E) D O = D Fe aS 


Zihi (1.22) 

It often happens with path integrals that formal procedures are best defined, as 
above, by using the imaginary-time limits £ — -tioo. However, in practice it is 
common instead to express the theory in terms of Minkowski spacetime. Thus, 
the generating functional will involve the real-time limits £ — +00. Does the 
dominance of the ground-state contribution, as in Eq. (1.21), continue to hold? 
The answer is ‘yes’. At an intuitive level, one understands this as a consequence 
of the rapid variation of the phase e~/4" 
phase variation accompanying the increased energy E, of any excited state washes 
out its contribution relative to that of the ground state. In a more formal sense, 
the real-time limit is defined by an analytic continuation from imaginary time. To 


in the limit £ — oo. The more rapid 


properly define the continuation, one must introduce appropriate ‘ie’ factors into 
the Green’s functions in order to deal with various singularities. Beginning with the 
next section, we shall often employ the Minkowski formulation and thus explicitly 
display the ‘ie’ terms in our formulae. 

The prescription given in Eq. (1.22) represents a powerful but formal procedure 
for the generation of matrix elements in the presence of an arbitrary potential V (x). 
Unfortunately, an explicit evaluation is no more generally accessible via this route 
than is an exact solution of the Schrödinger equation. In practice, aside from an 
occasional special case, the only path integrals which can be performed exactly 
are those in quadratic form. However, approximation procedures are generally 
available. 

One of the most common of these is perturbation theory. Suppose that the full 
potential V(x) is the sum of two parts V;(x) and V(x), where V(x) is such that 
the generating functional can be evaluated exactly while V2(x) is in some sense 
small. Then we can write 


p tf m +2 fi ° 
Zi] — „im, Dix (t)]e' Ji dt[ Fx (t)-V (x(t)) VEMO O] 


ti >ioo 


= lim eh at y(i) zO 


aw (ee ô "Or 
a = J dt v( rl ZF], (1.23) 
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where 
ZH] = lim [ro cih dt[ 5 O- VEMO] (1.24) 
tfio 


is the generating functional for Vı(x) alone. Obviously, Eq. (1.23) defines an 
expansion for Z[j] in powers of the perturbing potential V2(x). 


A-2 The harmonic oscillator 


It is useful to interrupt our formal development by considering the harmonic oscil- 
lator as an example of these methods. This treatment turns out to reproduce known 
oscillator properties with the use of functional methods, which are very similar to 
corresponding field-theory techniques. 

It is most convenient to address the problem by employing Fourier transforms, 


x(t) = f = a 5 (E), (2.1) 


whereby for t; = —oo and tz = +00, 


S! [x(t)] =j 4 (3e Hak ne) +x) 


[0,0] 


dE PENET Potae l~ z 
a É (E? — w°)%(E)X(—E) + pl Ea Ey omw] 


Qn 
-| 5 P (E? — oi (E) (E) — —j(B) j(-E 
= “oo 2 p œ“ )x (E)x ) Im” E2 oœ $ ie” DE 
(2.2) 


with the definition ¥'(E) = x(£) + j(E)/(mE2 — mœ? + ie). An infintesimal 
imaginary part ie has been introduced to make the integration precise. Upon taking 
the inverse Fourier transform 


f i dE eet ~/ 1 = f A agat 
x= an? xX (E) = x(t) Po dt D(t —t)j(t), (2.3) 
=00 =00 
where 
D(t = rj = f dE niei 1 —=— i e tlt (2.4) 
-AIr E? — œ? + ic 2w 
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we have 
oo 2 
Sİ [x(t)] = J dt ( 42) — TP x2) 
ane 2 2 
1 [0,6] [0,0] 
-_ J ar | dt j(@t)D(t —t')j(r’). (2.5) 
2m —0oo =00 
Finally, changing variables from x(t) to x’(t) we obtain the generating functional 


4 if dt ( mH _ mo? 12 
A= (opel. sy) 
x e` Ta Soo tt f dt J ODE-H) jC’) 
= Z[0]e 2m o dt I JODU, (2.6) 


Note that the above change of variables has left the measure invariant ( fi D[x(t)] = 
SDO). 

We can use this result to calculate arbitrary oscillator matrix elements. Thus for 
t > tı, we have for the ground state 


1 Zj] 


(y) 
OIT Œ (t2)x(t1)) 10) = (~i) ZIO GDS o 


i eiet) 
= —D(h — t) = ——., (2.7) 
m 2ma@ 
which, in the limit t —> tı, reproduces the familiar result 
> 1 
(O|x“|0) = ——. (2.8) 
2ma@ 


Although only ground-state expectation values have been treated thus far, it is 
also possible to deal with arbitrary oscillator matrix elements with this formalism 
by generalizing the operator relation 


In) = —— (a")" [0) (2.9) 
where 
at = a (x — r) (2.10) 
2 mæ 


is the usual creation operator. First, however, it is convenient to use the classical 
relation p = mx to rewrite the operator a‘ as 


gs (1 af =) x(t). (2.11) 
w 
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In a simple application, we calculate that 


(O|x|1) = lim f= (1-+ j ) ouxcerx(anio 
hott 2 æ Of} 


; 9 [M@ io 1 8? : 
= lim (—i) 1— = Lil 
pee 2 w at, ) Z[0] Sj Cj t) 
A mæ i oð i 
= lim (1 = ) D(t = ti) 
port VY 2 @ Ot; / m 
= l (2.12) 
— Smo l 


which agrees with the result obtained by more conventional means, 


Ja 1 
(O|x|1) = pa VPS (2.13) 


More complicated matrix elements can also be found, as with 


(1|x7]1) = = lim (1 + =) (1 = =) (O|x(t1)x? (t)x (f)|0) 


w Oto 


+4 ; z 
=l ji E PAA E 
Z[0] tpt! tyott 2 (22) ati (2) Of 


54 
Ôj (t1)ôj (12) dj (157 (t) j=0 


mo (i? . i a i a 

= — | — lim {1+ — — l- — — 

2 m a w Ot; w ðh 
yor 


x [Dt — h)DO) + 2D, — 1) DG — h)] = eZ (2.14) 
2mw 


ZL] 


which agrees with 


3 


1 l ; 
(1x71) = zg (a +a’) (a +a’) ID = 5. (2.15) 


2mw@ 


In this manner, arbitrary oscillator matrix elements can be reduced to ground-state 
expectation values, which in turn can be determined from the generating functional 
Z[j]. The ground-state amplitude in the presence of an arbitrary source j (¢) con- 
tains all the information about the harmonic oscillator. 

One should note the analogy of the above methods to those of quantum field the- 
ory. The ‘one-particle’ matrix elements involving |1) have been reduced to vacuum 
matrix elements by use of Eq. (2.9). This is similar to the LSZ reduction of fields 
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(see App. B-3). As a result, all that one needs to deal with are the vacuum Green’s 
functions. The generating functional is ideal for this purpose, as we shall see in our 
development of functional techniques in field theory. 


A-3 Field-theoretic formalism 


One of the advantages of the functional approach to quantum mechanics is that it 
can be taken over with little difficulty to quantum field theory. An important differ- 
ence is that instead of trajectories x(t), which pick out a particular point in space 
at a given time, one must deal with fields g(x, t) which are defined at all points 
in space at a given time t. Also, instead of a sum f D[x(t)] over trajectories one 
has instead a sum f [dy(x)] over all possible field configurations. Nevertheless, 
the analogy is rather direct. 


Path integrals with fields 


The formal transition from quantum mechanics to field theory can be accomplished 
by dividing spacetime, both time and space, into a set of tiny four-dimensional 
cubes of volume ôt ôx ôy 6z. Within each cube one takes the field 

p (xi, Yj, Zk, te) 68.1) 


as a constant. Derivatives are defined in terms of differences between fields in 
neighboring blocks, e.g., 


1 
do| ~ — (9 (Xi, Yj, Zr + ôt) — @ (xi, Yj ze t)) 6D 


N 
Xi Vj Xk oll ôt 


The lagrangian is easily found, 


LP. 8nP) |e, y, en XE P (xis Yi, ze ti) » Bn (Xis Yj Zt), 6-3) 
and the action is written as 
S > bxdydzdt L (Y (xi, Yj» Zk ti) ; InP (xi, Yj» Zk, ti)) . (3.4) 
i, j,k,l 


The field-theory analog of the path integral can then be constructed by summing 
over all possible field values in each cell 


oo 
D~ I] |_% (a, Yj Zk tı) e! Sloi ykz, dup (xi yj zkt)] (3.5) 


i,j,k,” T 


Formally, in the limit in which the cell size is taken to zero, this is written as 


fiio ef TPO), dupl (3.6) 
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By analogy with the quantum mechanical case (cf. Eq. (1.18)), it is clear that, since 
the time integration for S in Eq. (3.4) is from —oo to +00, this amplitude is to be 
identified with the vacuum-to-vacuum amplitude of the field theory, 


(010) = N f idge) eeo, (3.7) 


Generally, quantum field theory is formulated in terms of vacuum expectation val- 
ues of time-ordered products of the fields 


G (x1, ..., Xn) = (OJT (1) P&r) 10) (3.8) 


i.e., the Green’s functions of the theory. By analogy with the quantum-mechanical 
case, one is naturally led to the path-integral definition 


GG esate) =N f doo p eO, (3.9) 


where N is a normalization factor. Again we emphasize that all quantities here are 
c numbers and no operators are involved. In terms of a functional representation, 
we then have from Eqs. (3.7), (3.9), 


SdP G01) + panei T e] 


G™(x1,..., Xn) = 
(xı xX ) f [dp(x)] 2! Sla) anp] 


(3.10) 


Generating functional with fields 


These Green’s functions can most easily be evaluated by use of the generating 
functional 


Z[j] = x f [do (x)] eG S19). 4n9)]+i f dx jp) (3.11) 
Functional differentiation for fields is defined by 
a = 3% ~ —y), (3.12) 
which lets us obtain (cf. Eq. (3.9)) 
5” 


1 
(n) SN ee > 
G (x1, ...5Xn) = (-1) ZiO Boa) yG P (3.13) 


As an example of this formalism consider the free scalar field theory 


m2 


1 
LOX) = zupy — ae (3.14) 
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In general, we have 


[ee] i” n io) 
Z[j1 = Z10) = i J dx jav] GO O taessa GAS) 
n=0 ` k=1“ 7% 


where the generating functional Z®[j] is given by 


m 


Z0) [i] = n f [dy(x)] gil dah zono o- ti ojo) (3.16) 


There exist two common ways in which to handle the issue of convergence for 
such functional integrals, i.e., to ensure acceptable behavior for large vy”. One 
is to give the mass an infinitesimal negative imaginary part, m? —> m? — ie. 
This is the approach we shall employ in the discussion to follow. The second 
involves a continuation to euclidean space by means of t — —it wherein the 


functional integral becomes 
= fidis t m2 2 
(0|0) = N fidoo fd xe (53.08 .0+ Te ) (3.17) 


and is now convergent due to the negative argument of the exponential. Continua- 
tion back to Minkowski space then yields the desired result. 
Integrating by parts, we have from Eq. (3.16) 


Zr j] = vf [dg] et! Dewo] 


— vf [ag] en lS atx g(x) Oxo" (x)+f d4x fdty jaAra-y)i(y)] (3.18) 
where O, = O, + m? — ie and 


gompta / the TG), 


Ax = Pk ika- 
ee ae eos k2 — m2? + ie’ 
(O; +m Ar — y) = 8 (x — y). (3.19) 


Note that we have used invariance of the measure ( f [d y ] =f [dg' J- Finally, we 
recognize a factor of Z [0] in Eq. (3.18), thus leading to the expression 


ZT] Z ZOT S Sy I@MAFE-YIO), (3.20) 
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We can now determine the Green’s functions for the free field theory, e.g., 
(=i)? 82 
Z[O] 8j 1) 8j (X2) 


G (x1, x2) = ZOU) =iArQy— 2), 


j=0 


(=i) 54 
ZOLO] 8j (x1)8j Œ2)8j (x3)8j (x4) j=0 

= G (x1, x2)G (x3, x4) + GO (x1, x3)GP (x2, x4) 

+ GË (x1, x4)G© (x2, x3). (3.21) 


G (x1, x2, X3, x4) = ZF) 


More interesting is the case of a self-interacting field theory for which the 
lagrangian becomes 
1 
2 
The theory is no longer exactly soluble, but one can find a perturbative solution by 
use of the generating functional 


1 
L(x) = =3 p3" p — 5m + Lin (yg) = L(g) + Lint). (3.22) 


Zj] =N J [do(x)] ef fats EOOH) 


z Nil es Lin(-ixty) zO [j]. (3.23) 


As before, the Green’s functions of the theory are given by 


1 “, —id i f d4x Lim (-i gis) 
GO (0.5 Xn) = elira aO o (3.24) 
Z[0] I] bj (Xx) j=0 
For most purposes one requires only the connected portions of the Green’s 
function, i.e., those diagrams which cannot be broken into two or more disjoint 
pieces. This is illustrated in Fig. A—1 which can be found by dividing the full 
Green’s function 


G (x1, ...,%n) = (OIT (a1) -+ P%n))10) (3.25) 


into products of connected particle sectors and dividing by the vacuum-to-vacuum 
amplitude (0|0) in each sector. 
Mathematically, one eliminates the disconnected diagrams by defining 


Zi) = ei, (3.26) 


Then one can show that W[/] is the generating functional for connected Green’s 
functions, 


oO i” o0 oo 
wil= >= | dau: f die Ii) Ja eas G27) 
n=0 `Y% =N 
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X & |p X 


(a) (b) (c) 


Fig. A-1 Contributions to the four-point Green’s function in gt theory: (a)—(b) 
connected, (c)—(d) disconnected. 


where 


ô” 
Ge isi SI — — Wj]. (3.28) 
ôj (x1) pee ôj (Xn) j=0 


A-4 Quadratic forms 


The most important example of a soluble path integral is one that is quadratic in 
the fields because, at least formally, it can be solved exactly. 
Let us consider an action quadratic in the fields, 


S= -f dx p(x) O(x), (4.1) 


where O is some differential operator which may contain fields distinct from ø 
within it. The general result for the quadratic path integral is given by 


Üi = Jipoi" POW — N[det Ol, (4.2) 


where det O is the determinant of the operator O. In order to prove this, one can 
expand g(x) in terms of eigenfunctions of O, 


(x) = Y anGn(x), (4.3) 
where @, (x) satisfies 


Og, (X) = An@n(x) and J d*x Qn (X)Pm(X) = nm. (4.4) 


The sum over all field values can then be performed by summing over all values of 
the expansion coefficients an, 


CO 
Luna N T dan! emi S x ER ange) E ape) 
n =09 


= ajf da, e = N' (det 0)", (4.5) 
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where N, N’ are normalization constants and 
[0,6] 

det O = I] he (4.6) 
n=l 


denotes, as usual, the product of operator eigenvalues. 

In general, some effort is required to evaluate the determinant of an operator. One 
valuable relation, easily proven for finite dimensional matrices and generalizable 
to infinite dimensional ones is* 


det O = exp(tr In O). (4.7) 


This trace now denotes a summation over spacetime points, i.e., 
tind = f d'x (x 1in0 |), (4.8) 


which is the most commonly used form in practice. 


Background field method to one loop 


We can illustrate one use of this result by constructing an expansion about a back- 
ground field configuration (which satisfies the classical equation of motion) and 
retaining the quantum fluctuations up to quadratic order. Consider a scalar field 
theory with interaction Lint ((x)). We define ø as a solution to 


(O + m°) (x) — Ling (PE) = J). (4.9) 
Writing 
g(x) = P(x) + d(x), (4.10) 


leads to the generating functional 


ZIJ] = eC SPO S dx jG) 
| [ddq] ef S #4 (dudy Sp- (M-L) 4. (4.11) 
where 
- ra i ee m? > - 
S[o&Œ)] = Ja x z Pu P(X)O" G(x) =a (x) + Lint (P(X)) } - (4.12) 


4 For a discrete basis, this follows from the result 


exp(tr InO) = exp) Indy = I] exp(ln àn) = JI An = det O, 
n n 


n 


where Àn are the eigenvalues of the operator O. 
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Integration by parts gives 
ZLj] = eC SPON S dx pwj) [tase e73 S Ax spa) Orpa) (4.13) 
where 
O, = O; +m —£i,(@(x)). (4.14) 
The functional integration can then be performed (cf. Eq. (4.5)) and we obtain 
ZĻj] = const. (det 0,7? eC SPO S dx jG) (4.15) 
It is convenient to normalize the determinant somewhat differently by defining 
Oor =O, +m’. (4.16) 


Then, suppressing the x subscript, we write 


(det O)~'/? = const. (det 0710)", (4.17) 
where 
const. = (det Oo)~!/7, (4.18) 
and 
O'O =14+ Arli (o). (4.19) 
Using Eq. (4.2) we have 
ZU] = Nel SPOR S dx jO- Te m+" D) (4.20) 


The generating functional for connected diagrams can now be identified 
immediately as 


Wij] = S [Ø] + J d*x j(x)@(x) + st In(1+ ArLi, @) 
1 1 
= [as [3080 + Lint (P) — zP) Lin @| 
+ sit In (1 + ApL”,()).- (4.21) 


The trace ‘Tr’ includes the integration over spacetime variables and can be 
interpreted as follows, 
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-i 


Tr In [1 + ArLi, (@)] = Tr s 


n=1 


Tr [Ar La (@)] = Je x Ar — x)Lin (P), 


Ar Lin @))’ 


Tr [ArLiy @) AL O] = f dx f d'y Ara = e GO) 
x AFO — x)Li, (G(x). (4.22) 


In this manner, one-loop diagrams containing arbitrary numbers of £; (9) factors 
are generated. The physics associated with this approximation can be gleaned from 
counting arguments. The overall power of fi attached to a particular diagram can 
be found by noting that associated with a propagator and a vertex are the powers h 
and A`}, respectively. There is also an overall factor of A for each diagram. Then 
with the relation 


no. internal lines — no. internal vertices = no. loops — 1 


we see that this approximation corresponds to an expansion to one loop. The clas- 
sical phase generates the tree diagram (O(h’)) contribution and the determinant 
yields the one-loop (O(h')) correction to a given amplitude. 


A-5 Fermion field theory 


Thus far, our development has been performed within the simple context of scalar 
fields. It is important also to consider the case of fermion fields where the require- 
ments of antisymmetry impose interesting modifications on functional integra- 
tion techniques. The key to the treatment of anticommuting fields is the use of 
Grassmann variables. Thus, while ordinary c-number quantities (hereafter denoted 
by roman letters a, b, .. .) commute with one another, 


[a,a] = [a, b] = [a,c] =--- = 0, (5.1) 


the Grassmann numbers (hereafter denoted by Greek letters a, B,...) 
anticommute, 


{a, a} = {a, B} = fa, y} = -= 0. (5.2) 


It follows that the square of a Grassmann quantity must vanish, 


a =B =y? =-= 0, (5.3) 
and that any function must have the general expansion 


f(a) = fot fia, g(a, P) = go + 10 + &2P + g308. (5.4) 
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Differentiation is defined correspondingly via 


da 
Pe, et a 
so that in the notation of Eq. (5.4) we have 
T a) = fi, ie B) = 8&2 — g3a. (5.6) 
Second derivatives then have the property 
@ 
Jada ~ 0. (5.7) 


We must also define the concept of Grassmann integration. If we demand that 
integration have the property of translation invariance 


fa fla) = fa f(@+t+ B), (5.8) 
it follows that 
fas fiß =0 or fa = 0. (5.9) 


The normalization in the diagonal integral can be chosen for convenience, 


[dea=t, [ator (5.10) 
Let us extend this formalism to a matrix notation by considering the discrete 


sets a = {aj,...,Q@,} and & = {a,...,a@,} Of Grassmann variables. A class of 
integrals which commonly arises in a functional framework is 


Z[M] = [dan aa dan ---dayeM”, (5.11) 
As an example, the simple 2 x 2 case is calculated to be 
Z[M] = J da dà, day da, [1 + i&;M;jæj 
+ã% (M11 M2 — Mj2Mp1) |. (5.12) 
Only the final term survives the integration, and we obtain 


Z[M] = det M. (5.13) 
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This result generalizes to the n x n system [Le 82] yielding essentially the inverse 
of the result found for Bose fields, 


ZIM] Femi = J dan- - dăı dary ++ daye'*"* = det M, 
Z[M]Bose = faa; -- -daž da,---daye~* ™* « (det M)! . (5.14) 


We can now extend this formalism to the case of fermion fields w(x) and Y (x). 
Since such quantities always enter the lagrangian quadratically, the functional 
integral can be performed exactly to yield 


Z[O] = f iay [dy] i! 700w — N det O. (5.15) 


The remaining development proceeds parallel to that given for scalar fields. Given 
the free field lagrangian 


Lo (Y, Y) = W(x) G — m) Yx), (5.16) 


the generating functional for the noninteracting spin one-half field becomes 
Z[n, ñ] = I [dy] [dy] ei LEER OkO HOOH n] (5.17) 


where O, = if, — m + ie and n(x), n(x) are Grassmann fields. Introducing the 
change of variables 


Persya- J OSES. HO), 


W(x) = W(x) - J d*y n(y)Sr(y, x), 

d*k —ik-(x—y) i 

(27)t k—m-+ie 

(if, — m)Sr(x — y) =5(x — y), (5.18) 


iSF(x—y)= 


we find that an alternative form for the generating functional is 
Zima [ (av lay E Ow oes Tesaaisreone, 
= Z[0, Ojei S E SAY BOS yO), (5.19) 


Thus, the generating functional for connected diagrams is 


Win, 7] = - f ats f ay (x) Sr x, yN), (5.20) 
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and the only nonvanishing connected Green’s function is 
&W 
ên (x2)ôN (x1) 
dtk —ik-(x1—x2) 
Qn)" k —m +ie’ 


G2) (%1, x2) = (-i)?? 


i 


=> Sr(x1, X2) = (5.21) 


which is the usual Feynman propagator. 


A-6 Gauge theories 


For our final topic, we examine gauge theories within a functional framework. We 
shall employ QED as the archetypical example, for which the action is 


1 1 
S[Ay] = -3 fats Fa F” = 5 fats (A, 0, A" — A „3t 3r Ay) 
1 
=5 J d*x A OH A,, (6.1) 
where the second line follows from the first by an integration by parts and 


o» =e" Dy, — 3% 3. (6.2) 
In the presence of a source j,,, the generating functional is then 
Zliul =N I [dA ]e SAHS dx jA, (6.3) 
Due to the bilinear form of Eq. (6.1), it would appear that one could perform the 
functional integration as usual, resulting in 
ZLiu] = ZOET S EAS Y J OD rw EDO, (6.4) 
where the inverse operator Dr, (x, y) is defined as 
O% Druy(x, y) = SSO — y). (6.5) 


However, this is illusory since the inverse does not exist. That is, acting on Eq. (6.5) 
from the left with the derivative 0;° yields 


Ox Dryv(x, y) = 076 (x — y), (6.6) 


implying that D,,,, must be infinite. An alternative way to demonstrate that O#” 
is a singular operator is to observe that 


Ov” d*a = 0. (6.7) 


Thus, any four-gradient 0*q@ is an eigenfunction of O#” having eigenvalue zero, 
and an operator having zero eigenvalues does not possess an inverse. 
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Gauge fixing 
The occurrence of such a divergence in the generating functional of a gauge theory 


can be traced to gauge invariance. For QED, any gauge transformation of vector 
potentials (cf. Eq. (II-1.3)), 


1 
Ap (x) > A (œx) = A (x) + zala), (6.8) 
leaves the action invariant, 
SIAL œ] = S[A L &)]. (6.9) 


If we partition the full field integration [dA,,] into a component [d A u] that includes 
only those configurations which are not related by a gauge transformation and a 
component [da] that denotes all possible gauge transformations, then we have 


J [dA] is^] = i [dA,,] eiA x J [da]. (6.10) 


But f [da] is clearly infinite and this is the origin of the problem. The solution, first 
given by Faddeev and Popov [FaP 67], involves finding a procedure which some- 
how isolates the integration over the distinctly different vector potentials A u(X). 
In order to understand this technique, we shall first examine a finite-dimensional 
analog [Ra 89]. 

Consider the functional 


N [0,0] 
Z[A] = p | ax | e7 Eri AU (6.11) 
isl’ © 


where A is an N x N matrix. Suppose that A is brought into diagonal form AP by 
linear transformation R, 


AP = RAR’. (6.12) 


Letting y = Rx denote the coordinates in the diagonal basis, we have 


N N 
Z[A] = fa eT EKARI — Tf eT Lue Ate 
i=l i=1 


yl 
= I] (5) = nl? idet A]. (6.13) 


Suppose that the last n of the N eigenvalues belonging to A vanish. The exponential 
factor in Eq. (6.13) is then independent of the coordinates yy_n41,..., yy and the 
corresponding integrations f dyy—n41...f dyn diverge. This is reflected in the 
vanishing of det A, and causes the quantities in Eq. (6.13) to diverge. The infinity 
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is removed if the integration is restricted to only variables associated with nonzero 
eigenvalues, in which case we obtain the finite result 


N-n 


I] J J eT Dk RAL (6.14) 
i=l 


It is possible to express Z/[A] as an integral over the full range of indices 1 < 
i < N by defining variables 


ZITA] — l 


Yi UsteN =n) 
= (6.15) 
arbitrary (N-—n+1<i<WN), 
and writing for the generating functional 
N 
Z![A] = T J ta B(en—npt) Swe TAHA (6.16) 
i=1 


Upon tranforming back to an arbitrary set of coordinates {x;}, we obtain the useful 
expression 


N 
ZITA] = TI fas det 
i=l 


Let us now return to the subject of gauge fields, broadening the scope of our 
discussion to include even nonabelian gauge theories. By analogy, corresponding 
to the variables zy_n41,..., zy Will be the gauge degrees of freedom and the pre- 
scription of Faddeev and Popov becomes for generic gauge fields A‘ (x), 


N 
[| 8 (Gy) eo msm, 6.17) 


j=N—n+1 


əz 
ax 


Zí = IT fan] S (Gp (A$ )) det |5Gy/daq| eid, (6.18) 
a b=1 


where the {œa} are gauge-transformation parameters (cf. Sect. IL-4) and the 
{ Gp (AR) are functions which vanish for some value of A; (x). Since the {Gp} 
serve to define the gauge, such contributions to the generating functional are 
referred to as gauge-fixing terms. The variation 6G,/da, signifies the response 
of the gauge-fixing function G, to a gauge-transformation parameter a,. 

For any gauge theory, there are a variety of choices possible for the gauge-fixing 
function G. In QED, one defines the axial gauge by 


G(A,) =n, A", (6.19) 


where n, is an arbitrary spacelike four-vector. Due to the presence of the four- 
vector n“, one must forgo manifestly covariant Feynman rules in this approach. 
Thus, one often employs a covariant gauge-fixing condition such as 
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G(A,) = 0° A, =F, (6.20) 


where F is an arbitrary constant. Under the gauge transformation of Eq. (6.8), we 
find 


G(A,,) > G(A,) + Da, (6.21) 
so that 
6G/sa = O. (6.22) 


Referring back to the general formula of Eq. (6.18), we see in this case that 
det |5G/da| is independent of the gauge field and thus may be dropped from the 
functional integral. The QED generating functional then becomes 


Aneel | [dA] 6(0"A,, — F) es (GADA 3 Ayah 82 Avt jA") 


=N [ldapel” (Au Ox AM +juA“) 
— Z[Oe72/ d*x f dy ODP Oy) jo), (6.23) 


Note that, as promised, this result is finite and leads to a photon propagator in 
Feynman gauge 

i & ZL jul = 
ZO] Eh OE) Je? 
The result is independent of the choice of F. Consequently, even if the constant F is 
evaluated to the status of a field F (x), one can functionally integrate over F (x) with 


an arbitrary weighting factor since this will only affect the overall normalization of 
the generating functional. A common choice is 


4 2 
dq emit ay) g” 


D(x, y) = . (6.24 


/ [dF] 6(0"A, — Fixe BLE PO c eh Sas OPAL) (6.25) 


where is a real-valued parameter. In this case, the generating functional becomes 


Ay (Og — aa”) Ay— xe (84 Ap)” juar) 


if d4x(4 
Zij] = J AL a (6.26) 
The integrand of the above spacetime integral can be regarded as the effective 
lagrangian of the theory, and the gauge-fixing term appears as one of its contri- 
butions. At this point, the functional integration can be carried out with impunity 
to obtain 


Z[ja] = Z[Ole~2 [dix fdty JODE (yi) (6.27) 
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where D%” is defined as 
(0,8 — (1 — &7')a# ar) Dr — y) = SSO — y). (6.28) 


We find in this way the form of the photon propagator in an arbitrary gauge, as 
appearing in Eq. (II-1.17). 


Ghost fields 


In the path-integral formalism, if the generating functional can be written in purely 
exponential form, then one can read off the lagrangian of the theory from the 
exponent. However, the general formula in Eq. (6.18) for a gauge-fixed generat- 
ing functional contains a seemingly nonexponential factor, the determinant factor 
det |5G,/5a,|. A fruitful procedure, due to Faddeev and Popov, for expressing the 
determinant as an exponential factor is motivated by the identity (cf. Eq. (5.15)), 


det M = N J [dc][dé] is, (6.29) 


where c, c are Grassmann fields. This identity suggests that we replace the deter- 
minant factor with an appropriate functional integration over Grassmann variables. 
For QED, the generating functional can then be written in the concise form 


ZLiul =N J [dA Mdc]{de] of fas Anlst") Ava OF AnP ETCH jA) 
(6.30) 


As pointed out earlier, for this case the integration over c, c yields only an unim- 
portant constant and may be discarded. However, for nonabelian gauge theory 
Eq. (6.30) generalizes to 


zij = / [[taatitac yaar et [Eriam aA] (6 31) 


a,b,d 
where repeated indices are summed over. The quantities 


me 6.32) 

ba’ 
will generally depend upon the fields A’, themselves. Thus, the fields {ca}, {ca} 
will appear as degrees of freedom in the defining lagrangian of the theory. How- 
ever, although coupled to the gauge fields A’, through cMc, they do not interact 
with any source terms and therefore can only appear in closed loops inside more 
complex diagrams. Since these Grassmann quantities are unphysical, they are 


5 Such loops must include a multiplicative factor of —1 to account for the anticommuting nature of these 
variables. 
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often called Faddeev—Popov ghost fields. They are scalar, anticommuting variables, 
which transform as members of the regular representation of the gauge group, e.g., 
for the gauge group SU (n), there are n? — 1 of the {ca} and {c4} fields. 

To complete the discussion, let us determine the ghost-field contribution to the 
QCD lagrangian. We choose F, = 3, A), and note the form of a gauge transforma- 
tion (cf. Eqs. (I-5.12), (I-5.17) with œa infinitesimal), 


/ 1 
Ap > Aj’ = A, + —d" ay — foae Akae. (6.33) 
83 
Then we find from a direct evaluation of 0F;/da, followed by the rescaling 
—g, tc > Če, 


Lon = —Cp0y[dp00” > 83 fbae Aa |Ce- (6.34) 


Upon performing an integration by parts in the first term and relabeling the indices 
in the second, we obtain the ghost contribution to the QCD lagrangian of 
Eq. (II-2.25). 


Problems 


(1) The van Vleck determinant 
The semiclassical approximation to the propagator (valid as A — 0) can be 
derived by expanding about the classical path. Writing 


X(t) = Xq(t) + ôx (t), 
we have 


, ï , 82S i 
D(xf, tf; Xi, ti) = dmi omoi EO aan O, 


where 


525S a ( 32 = rean) Sar) 
dx(t)dx(t’) P 3x4) 


and we have dropped the term linear in ôx(t) by Hamilton’s condition. 
Performing the path integration we have then 


825 —1/2 
D(x, tr; Xij; ti) = N | det ——————_ ei Sha 
fot. 5x(Hdx(’) 


where N is a normalization constant and the quantity inside the square root is 
called the van Vleck determinant. 
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(a) Show that this can be written in the form 


PS P 71 ska” 
Feoxal -| Ax Ox; | 


Hint. The following argument is hardly rigorous but leads to the correct 
answer. Write 


Dx ptf xi, ti) = AG Xi; tf — tJe ere) 
and use completeness to show that at equal times 
(xf = xi) D (xf, ti; Xi, ti) 


= fax A(xp, x; T)A* (xi, x; Te Sap T) -8a ix; T) 


where T is an arbitrary positive time. Now define po(x;,x; T) = 
OSa(x;, x; T)/ðx; so that 


Salty, x; T) — Sa, x; T) = ay — xip i, x; T). 


Finally, change variables from x to p and compare with the free particle 
result to obtain 


at r= | 28a di 
Xf, Xi; = eee . 
: 211 OX £OX; 


(b) Show that 


wm 


Sup wet) SETS im dx,/2m(E — V(x) 


Xi 


and verify that 


1 8Sa V m ma 
ÈE ia] ~ aoe (4) Si dx =a 
Hint: Recall that t is an independent variable, so that 
ot ot 
= T = an 
We thus have the result for the semiclassical propagator 


1/2 
Dap tps x,t) = | ————"__ — eisa, 
— 2mixea(tira(ty) V dx Xa") 


which is identical to that found from WKB methods. 


Problems 519 


(2) Propagator for the charged scalar field 
The lagrangian for a charged scalar field g of mass m and charge e in the 
presence of an external (c-number) potential A,, is 


L = D*¢* Dig — mo*9, 


where D, = 0, +ieA, is the covariant derivative. 
(a) Show that the full Feynman propagator, 
_ fldglldg*loa')e* ei as £0 


Dr(x 3x) = Tidolldo Jei TTO 7: 


can be written as 
Dp(x'; x) = —i(x’|(D" D, + m? — ie)! |x). 


Suggestion: This is a quadratic form. Use the generating functional to 
integrate it. 

(b) By expanding Dpr(x’;x) as a power series in A„(x), show that an 
alternative representation for the propagator is 


ioe) 
. 2 . 
Dr(x’; x) Aen wif ds eis D" Du+m =i ey, 
0 


(3) Functional methods and gf theory 
Consider a scalar field theory with the self-interaction 


À 
Lint = — 7? 0). 
(a) Show that the generating functional can be written as 
oh 4 
Zj] = Ne" Tela fatx fdty jwiAray jo) 


where the free field Feynman propagator i A p(x, y) is as in Eq. (C—2.12). 
(b) Evaluate the two-point function to O(A7). Associate a Feynman diagram 
with each term of this expansion and separate the connected and discon- 
nected diagrams. 
(c) Calculate the connected generating functional via 


jà PRA : f 
W[j] = Wolj] — i In f + ei WOU] (04 — i) | 
where 
1 ae 
WO = fefe i(@)iAr(x, IO). 


(d) Compare the connected diagrams found in parts (b) and (c). 


Appendix B 
Advanced field-theoretic methods 


B-1 The heat kernel 


When using path-integral techniques one must often evaluate quantities of the form 
H (x,t) = (x |e? | x}, (1.1) 


where D is a differential operator and t is a parameter. In this section, we shall 
describe the heat kernel method by which H (x, Tt) is expressed as a power series 
in t. For example, if in d dimensions the differential operator D is of the form 


D=O+4m'+V, (1.2) 


where V is some interaction, then the heat kernel expansion for H (x, T) is 
2 
=m 


H(x,t) = a ae [ao(x) +a(x)t +an(x)t*> +++], 1.3) 


where a;(x) are coefficients which will be determined below. 
Let us begin by citing the two most common occurrences of H (x, t). One is in 
the evaluation of the functional determinant 


det D = ett nD = ef dx Tr(x|InD|x) (1.4) 


where ‘Tr’ is a trace over internal variables like isospin, Dirac matrices, etc., and 
‘tr’ is a trace over these plus spacetime. The (generally singular) matrix element 
(x| In D|x) appearing in Eq. (1.4) can be expressed in a variety of ways. For exam- 
ple, in dimensional regularization one can use the identity 


in? = i N (et eh) (1.5) 
a 0 Xx 
to write 
ear —tD 
(x| In D|x) = -f — (x |e |x) +C, (1.6) 
0 
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where C is a divergent constant having no physical consequences. Substituting 
Eq. (1.3) into the above yields 


E a, a d 
(xlinDhx) -C= -a Do 2 T(n- 5) anc (1.7) 


The divergences in the series representation arise from the I function and are 
restricted in four dimensions to the terms do(x), a, (x), ao(x). 

The heat kernel can likewise be used to analyze the functional determinant in 
alternative regularization procedures, such as zeta-function regularization. Here, 
one expresses the matrix element (x| In D|x) as 

i 


d 
ae In a 
ds s=0 


~ fajli od 
~ato h a 


Ds 
p(x, s) = zzl dtt® 'H(x,t). (1.8) 


(x|InD|x) = — (x 


The penultimate equality in Eq. (1.8) is obtained from repeated formal differentia- 
tion of Eq. (1.6) with respect to D. Upon expanding the H (x, t) term in p(x, s), 
one arrives at the desired power-series expansion of (x|InD|x). This usage is 
applied in the next section. 

The other main use of the heat kernel is in the regularization of anomalies. Often 
one is faced with making sense of Tr (x |O(x)|x), where O is a local operator. 
Although such quantities are generally singular, they can be defined in a gauge- 
invariant manner by damping out the contributions of large eigenvalues, 


Tr (x |O(x)| x) = lim Tr (x |O@)e~ | x), (1.9) 


where D is a gauge-invariant differential operator. Again, it is only the low-order 
coefficients, generally those up to a2(x), which contribute in the € — 0 limit. We 
employ this technique in Sects. I-3,4. 

As an example of heat-kernel techniques, let us consider the following operator 
defined in d dimensions: 


ð 
D = d,d! +m + 0 (x) (4, = aa + Tuto) ; (1.10) 
x 
where T(x) and o (x) are functions and/or matrices defined in some internal sym- 
metry space. In particular, neither T, nor ø contains derivative operators. Employ- 
ing a complete set of momentum eigenstates {|p)} allows us to express the heat 
kernel as 
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d'p —ip-x —tD ip-x 
He n= | oe Pie ee, (1.11) 
where in d dimensions use is made of the relations 
n 1 ip-x 
(plx) = Qran! ; 
dp u 
(xIx) = J PE = 8-2), 
d4x “7 ot 
, _ i(p'—p)yx — gdp! _ . 1.12 


From the identities 


dye'?* = e?*(ipy + dy), 
d,d"e'?* = e'?* (ip, + d,)Gp" + d”), (1.13) 


we can then write 
d 
H(x T) — J d P eo tL Gputdy)’ +m +o | 
(27)? 


d‘p 


= t[p?—m?] ,—t|d-d+o+2ip-d] 
~ | Qr)! E . (1.14) 


The first exponential factor is simply the free field result, while all the interesting 
physics is in the second exponential. The latter can be Taylor expanded in powers 
of t, keeping those terms which contribute up to order t? after the integration over 
momentum is performed. Note that each power of p° contributes a factor of 1/t. 
Thus, we obtain the expansion 


H(x,t)= J a: gewij —t|d:-d+0] 
? (27)! 


2 
+E- d+od-d+0)-4p-d p-d] 


4r? 
+a [pd p-dd-dt+o)+p-dd-d+o)p-d 


+(d-d+o)p-dp-d| 
16t* 


a 


ped ped ped pd +-~| (1.15) 


where terms odd in p have been dropped and we have displayed only those © (t°) 
and O(t) terms which contribute to H at order t? after p is integrated over. 
To perform the integral, it is convenient to continue to euclidean momentum 
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Pe = {P1, P2, P3, pa = —ipo}. Then, with the replacement p, p“ > -|p phl = 
— Po: we obtain 


d! pe (p2 2 dQa 2 2 
—(petm*)t _ d d-1 ,—(pyp+m*)t 
Onyi au nee © 
ant? 1 e™™T(d/2) 
~ T(d/2) (27)! 2r4/2 


2 


_ 1 el” T 
_ (4s) 4/2 4/2’ 
d v —m?t 
d DR ep ige 8 1 We TUEN 
(27 )4 d (4m)! t1! T(d/2) 
7 bev gomt 
Ta (4m )1/2r4/21 
d’ pr Zam emt 
—(petm’)t pH vy A Lo __ 
[oe E PEPEPEPE = (4s) 4/27 4/242 
(86H80 J- ELA Sve 4 57 52”) 
x . (1.16) 
4 
Employing these relations to evaluate Eq. (1.14) gives (to second order in T), 
m B je"? 
(x, t) = (47 )4/274/2 
x |l- to +r’ Pec d,|[d" a] 4+ tid [d“,o]] 
2 12 H’ v ’ 6 [bo ’ e 
(1.17) 
or in the notation of Eq. (1.3), 
ao(x) = 1, a\(x) = —o, 
1,,1 p 1 
aj =a + i ddd TA = [dy [d", ol]. (1.18) 
2 12 6 
Fermions are treated in a similar manner. For example, the identity 
1 
Inp = 5 In@D) (1.19) 


allows the same technique to be used for the operator J J. In particular let us 
consider the case where 


B=st+ivV+irys. (1.20) 
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With some work, one can cast this into the form of Eq. (1.10) with the identifica- 
tions 


DP =D = d,d" +0, 
d, = 06 +i Vp, aX eH ðp + Tus 


1 
O = 50 Vl" — 2A, AM + (i8, A" — [Vu AMT) ys, 
Vav = ða Vo — ð Va + ilVa, Vy] + ELA, Av]. (1.21) 


The values of a; (x) appearing in Eq. (1.18) can also be used in this case. The heat- 
kernel coefficients have been worked out for more general situations [Gi 75]. 
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In this section, we illustrate the method described above while also proving an 
important result for the theory of chiral symmetry. The goal is to demonstrate that 
all the divergences encountered at one loop can be absorbed into a renormalization 
of the coefficients of the O(E*) chiral lagrangian and to identify the renormaliza- 
tion constants. The technique used here, the background field method, is of consid- 
erable interest in its own right [Sc 51, De 67, Ab 82] and is applicable to areas such 
as general relativity [BiD 82]. 

The basic idea of the background field method is to calculate quantum correc- 
tions about some nonvanishing field configuration @, 


p(x) = px) + 69), (2.1) 


rather than about the zero field,’ and to then compute the path integral over the 
fluctuation ôg (x). The result is an effective action for ø. This effective action can 
be expanded in powers of Y and applied to matrix elements at tree level, resulting 
in a description of scattering processes at one-loop order. In the case of the chiral 
lagrangian, one expands the full chiral matrix 


U =U +60, (2.2) 


where U satisfies the classical equation of motion. Upon integration over ôU, one 
obtains the one-loop effective action for U. This contains a great deal of informa- 
tion. In particular, U can be expanded in the usual way in terms of a set of external 
meson fields 


U =exp(ia"@"/F) (a=1,...,8). (2.3) 


1 See the discussion in Appendix A-4. 
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Contained in Sos(U) is the effective one-loop action for arbitrary numbers of meson 
fields. Upon identification of renormalization constants, all processes become renor- 
malized at the same time. 

Our starting point is, in the notation of Sect. IV-6, the O (E?) lagrangian 


Fy f Fo F 7 
{y= Tr (D,,U D“U") + a (x'U+U'x). (2.4) 


The procedure to follow is rather technical, so let us first quote the end result of 
the calculation. Upon performing the one-loop quantum corrections, the effective 
action will have the form 


Sete = gn 4 gron 4 aes oer 


Here the lagrangians in S5", S;" are the ones quoted in Sect. VII-2, but now with 
renormalized coefficients. In particular S'°" is the sum S1™ = SP*° + SY where, 
in chiral SU (3) and employing dimensional regularization, S{'Y is given by 


si == | d| [Tr (D,UD“U')P 
Him Duo wv +09 
+ = Tr [D.UD“Ut (xut + Ux")] 


11 ; 2 5 l 
a [Tr (xU +Ux")] + ra (xU'xU' + Ux'Ux’) 


j 1 
+ zT (LyyD"“U DUT + Ry, D*UŻÝD”U) — z" (Ewu RU) 
(2.5) 
with 
1 


==> 
327? 


[Z inary]. (2.6) 
The terms in SY are all of the same form as the terms in the bare lagrangian at 
order E*. Therefore, all the divergences can be absorbed into renormalized values 
of these constants. The finite remainder, hy cannot be simply expressed as a 
local lagrangian, but can be worked out for any given transition. When S¢"" is added 
to the O(E*) tree-level lagrangian of Eq. (VII-2.7), the result has the same form 
but with coefficients 


L; = Li — yA, (2.7) 
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Table B—1. Renormalization coefficients. 


i i 2 4 5 6 8 9 10 

1 1 1 1 3 1 1 

SUD n wm è 8 3 34 BH 0 0 GE Hs 
3 3 1 3 11 5 1 1 

SUBG) Vi 2 6 8 8 a B® 4 74 


where the {y;} are numbers which are given in Table B—1 for both the case of chiral 
SU(2) and SU(3). Thus, the divergences can all be absorbed into the redefined 
parameters and these in turn can be determined from experiment. Let us now turn 
to the task of obtaining this result. 

In applying the background field method, there are a variety of ways to para- 
meterize 5U, and several different ones are used in the literature. The prime con- 
sideration is to maintain the unitarity property UU = 1 = (U T+ 6U +) (U + 6U ) 
along with UŻU = 1. We shall take 


U =Ue', (2.8) 


with A = åA" representing the quantum fluctuations. This choice is made to 
simplify the algebra in the heat-kernel renormalization approach, which we shall 
describe shortly. Another possible choice is 


U =éel"é (2.9) 


with n = A“n@ and ££ = U. These two forms are related by n = £ A&". Since in 
the path integral, we integrate over all values of A (or n) at each point of spacetime, 
these two choices are equivalent. 

The expansion of the lagrangian in terms of U and A is straightforward, and we 
find 


Tr (D, U D“UŤ) = Tr (D ŪD“Ū*) — 2i Tr (OD, TD" A) 


+ Tr [D A D"A + 0'D,0 (aba — D"A A)|, 
Tr (xU + Utx) = Tr (xU + Ux) +i Tr (A (xU — U+#x)) 
2 ST [a? (xt + Utx)], (2.10) 
where 
D,A=d,A+i[r,, A], (2.11) 


where r, is the matrix source function of Eqs. ([V—6.1,6.2). Since U satisfies the 
equation of motion, there is no term linear in A. One may integrate various terms 
in the action by parts to obtain 
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(0) 4 7 Fo a ab yb 
Sy = | dx į LU) -A (dud" +o)" A? +--+}, (2.12) 
where 
de = 50, +8, 
1 B _ _ _ 
re =a ([a%, 4°] (U'd,U +iU'e,U +ir,)), 
1 ee _ 2 _ _ 
ot = git ({A7, 4°} (x'U + Ux) + [>t UDU] [A?, UT DUJ). (2.13) 


The action is now a simple quadratic form, and the path integral may be per- 
formed. The only potential complication is the question of interpreting the integra- 
tion variables. This is referred to as the ‘question of the path-integral measure’. The 
integration over all the unitary matrices U can be accomplished by an integration 
over the parameters in the exponential 


fia = n fiaa“, (2.14) 


where N is a constant which plays no dynamical role. With this identification one 
obtains 


: , 2 b 
e! Wioop = [dare fates etaa AD 


= (det [d d" 4 of)” = e73" In(dyd* +0) (2.15) 
Here ‘tr’ indicates a trace over the spacetime indices as well as over the SU(N) 
indices a, b. 

The identification of divergences is most conveniently done by using the heat- 
kernel expansion derived earlier in App. B—1, where it is shown that all the ultra- 
violet divergences are contained in the first few expansion coefficients. The relevant 


terms are 


Wioop = 5 tt In (d,d + 0) 


1 4 : d d—2 
= zayn | eim fr (1- 5) m Tro 


d | 1 
+ mT (2 = 5) Tr (re + 57°) Bhat | , (2.16) 


where 


ab __ ab ab acpcb acpcb __ ab 
ra = p — TP TTP — TTY = [da da]. (2.17) 
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For N+ flavors, the operator part of the first term in Eq. (2.16) is 


N: TSE NZ-—1 2! 
Tro = Fa (D,0 D*U") + oa (xi + Uy). (2.18) 
F 


The above two traces are just those which appear in £2; as such, they can only 
modify the quantities Fy and m2. The remaining terms can be worked out with a 
bit more algebra. Using the identity 


ð, (U'a,U) — a, (U'a,U) = — [Uta U, UA, U], (2.19) 


we find for the field strength, 


pab = ar [|as x |([0t pð, ODO] +iO' LO +iRw)}. 220) 


This produces, for N flavors in chiral SU (N$), 
Tr (TaT) = Ltr ((0'D, 0, 010.0] [0'0 OtD*d) 
+ iN, Tr (Rað UU + Lað UOU) 
— N; Tr (Lw UR” U+) — “r Tr (LL + Ru R””), 


Tro? = - [Tr (D,UD"U")} + FT (D,ŪD,Ū') Tr (D“ŪD”Ū') 


oo| = 


zoa aaan FN ye 
+ M (D,,U D*U*D,Ū D*U*)+ a [Tr (x0 + tx) Ý 
i 


+ FT (D,,U D“U") Tr (xU' + Ux’) 


+ Mr (D,U D*Ut (xU' + Ux')) 


f 
8N; 


+ Tr ((xU' + Ux") (xUT +Ux`)). (2.21) 
The only operator which is not of the same form as the basic O(E*) lagrangian 
occurs in the first term of Tr T?. However, by use of Eq. (VII-2.3) for SU (3), it 
can be written as a linear combination of our standard forms. For Ny = 3, these 
add up to the result previously quoted in Eq. (2.5). Here, the divergence is in the 
parameter A. For convenience in applications, we have added some finite terms 
to the definitions of A in Eq. (2.6). The results for Ny = 2 are also quoted in 
Table B—1, although some of the operators are redundant for that case. 

The reader who has understood the above development as well as the standard 
perturbative methods presented in the main text will be prepared for the use of 
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the background field method in the full calculation of transition amplitudes. This 
procedure consists of writing 


d,d" +0 = Do + V 
Do = O + m? 
V = {ð T} +r,“ +0 - m, (2.22) 


where m? is the meson mass-squared matrix. The one-loop action is then expanded 
in powers of the interaction V 


Woop = str ln(d,d" +0) = str [In Do + In(1 + Dy 'V)] 


1 
=; [indy o'v -prvo v+ |. (2.23) 


The first term is an uninteresting constant which may be dropped, and the remain- 
der has the coordinate space form 


i 


Wioop = -i fats Tr [AF (x — x) V(x)] 


— i J d‘xd*yTr[Ar(x — y)V(y)Ar(y VOH (2.24) 


When the matrix elements of this action are taken, the result contains not only the 
divergent terms calculated above, but also the finite components of the one-loop 
amplitudes. The resulting expressions are presented fully in [GaL 84, GaL 85a]. 
This method allows one to calculate the one-loop corrections to many processes at 
the same time and, in practice, is a much simpler procedure for some of the more 
difficult calculations. 


B-3 PCAC and the soft-pion theorem 


We have emphasized the use of effective lagrangians to elucidate the symmetry 
predictions of a theory. For a dynamically broken chiral symmetry such as QCD, 
these predictions will relate processes which have different numbers of Goldstone 
bosons. The machinery of effective lagrangians will correctly yield such predic- 
tions, but it is often useful to have an alternative technique for understanding or 
calculating these results. In the case of chiral symmetry, this is provided by the 
soft-pion theorem, which explicitly relates a process with a pion to one with that 
pion removed from the amplitude. Calculations performed this way uses current 
algebra methods which go by the name of partial conservation of the axial cur- 
rent or PCAC [AdD 68]. While these techniques are often more cumbersome, they 
often are useful. This section describes these methods. 
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We can again turn to the sigma model to introduce this subject. We return to 
the effective lagrangian treatment of Chap. IV, with a pion mass included and the 
S field integrated out. The lagrangian of Eq. ([V—6.12) gives rise to the vector and 
axial-vector currents 


ar 
Vi = iz T [t! (U'A,U + Ud,U")], 
2 
A = iz Tr [c* (Uta,U — Ua,U")], (3.1) 


with k = 1, 2, 3. The equation of motion is found to be 


2 
a" (Uta, U) + (U - U’) =0, (3.2) 


and two important matrix elements are 
(0 [Af |m) = ivp, (0 |“ AF | 27 (p)) = vm”. (3.3) 


The former allows the identification v = Fy, where F» is the pion decay constant 
Fx = 92 MeV, while the latter follows either from Eq. (3.1) directly, or by use of 
the equation of motion for Ai, 


av Ak E [t* (U —U')] = Fem? at +- (3.4) 
uo 4 EAT . . 


This last equation forms the heart of the PCAC method. It describes a situation 
covered by Haag’s theorem (recall Sect. IV—1), and says that we may use either 
m* or aM AN (properly normalized) as the pion field. It is more general than the 
sigma model, which we used to motivate it. This, plus certain smoothness assump- 
tions, gives rise to a soft-pion theorem for the following matrix element of a local 
operator O, 


Jim, (*(@)B1Ola) = — 5 (BI [95, O] la), (3.5) 


i 

Fy 
where £, œ are arbitrary states and Q% = f ax Ak (x) is an axial charge. 

The proof of Eq. (3.5) starts with the LSZ reduction formula. We consider the 


matrix element for the process a —> f + 2*(q) as the pion four-momentum q is 
taken off the mass-shell, 


(D10 Ola) =i f ax e'1* (D + m=) (BIT (z*Œ)0(0)) la) 


=i fats e'f*(—q? + m2) (BIT (x*(x)O(0)) la), (8.6) 
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The pion field can be replaced by using the PCAC relation (valid in the sense of 
Haag’s theorem), 
k 1 


daa av At, (3.7) 


leading to 


2. gt 
— J d*x e'T™ (BIT (Ə A; (x)OO)) la). (3.8) 


(x(q) B10 Ola) = i= 


The derivative can be extracted from the time-ordered product by using 
a” (BIT (At (x)O(0)) la) 
= (BIT (0 Al (x) 0 (0)) læ) + 8 (xo) (B| [AG@), OC] |e), (3.9) 


where the last term arises from differentiating the functions 0(+x9), which occur 
in the time-ordering prescription. Upon integrating by parts, we find 


k _ se (m E q’) 4 iq-x k 
(x(q) B|OO)|a) = 12 d*x e'1™ [—(B| [Ag@), 0(0)] la) 8 (xo) 


— ig" (BIT (A£ œ) 0 0)) la)] . (3.10) 


Up to this stage all the formulae are exact for physical processes, even if appearing 
rather senseless, since ə AF, has the same singularity for q? —> m? as does the 
field zt. However, to obtain the soft-pion theorem one assumes that the matrix 
element does not vary much between its on-shell value and the point where the 


pion’s four-momentum vanishes. In that circumstance, we have [NaL 62, AdD 68] 


i E 
Jim (m @210la) = -7 (PI [25 00] la) + lim iq" Ry, 68.11) 
where 
R= -5 fas e'1* (BIT (Ai (x)O(0)) |a). (3.12) 


The remainder term of Eq. (3.11) vanishes unless R! has a singularity as q“ —> 0. 
Such a singularity can occur if there are intermediate states in Rf which are degen- 
erate in mass with either œ or 6. This last statement can be proven by inserting a 
complete set of intermediate states in the time-ordered product in Ri; and taking 
the q” — 0 limit. This caveat should be kept in mind as it is sometimes relevant. 
The soft-pion theorem relates to the intuitive picture for dynamically broken 
symmetries mentioned in Sect. I-6. Since a chiral transformation corresponds in the 
symmetry limit to the addition of a zero-energy Goldstone boson, we expect the 
states (£| and (17, -oB | to be related by the symmetry and, indeed, the soft-pion 
theorem expresses this. Although the soft-pion theorem is exact in the symmetry 
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limit, a smoothness assumption is needed in the real world to pass from q, = 0 to 
q? = m}, implying that corrections of order q, or of order m2. can be expected. 

In the Standard Model, the charge commutation rules are commonly abstracted 
from those of the quark model. Upon expressing charge operators in terms of quark 
fields, 


k 


k 37 ak k 4.7 ae 
= ax pz Q; = dx bps Y, (3:13) 


one obtains the algebra 


[vlag Te [5 Va] = iF" An 
[CO Ai] = if" Ak,  |On.A aay. (3.14) 


These commutation rules can be extended to equal-time commutators, which con- 
tain a charge density, e.g., 


[Vo (2), ALO] o0 = Uf FAL SO (x — y). (3.15) 


However, commutators which involve two spatial components can be more 
problematic [AdD 68]. 

Sometimes, in the PCAC approach, if the matrix element is assumed to be 
strictly constant, the various soft-pion limits turn out to be contradictory. If so, 
the amplitude must be extended to include momentum dependence, as happens in 
nonleptonic kaon decay. By contrast, the effective lagrangian approach automat- 
ically gives the appropriate momentum dependence, and its predictions follow in 
a straightforward manner. Moreover, effective lagrangians are especially useful in 
identifying and parameterizing corrections to the lowest-order results. They allow 
a systematic expansion in terms of energy and mass. 


B-4 Matching fields with different symmetry-transformation properties 


In Chapter IV, we described the construction of an effective lagrangian for pion 
fields with chiral transformation properties. However, most particles do not trans- 
form in the same way as the pions of that chapter. In a broader context, we require 
a procedure for combining fields with different symmetry properties. For example, 
in the case of hadronic physics one often needs to consider particles such as nucle- 
ons, ¢(770), etc., interacting with pions. A general approach for this was presented 
in a set of classic papers on the subject [We 68, CoWZ 69, CaCWZ 69]. We shall 
introduce this framework by again referring to the sigma model, and then we shall 
extend the results. 


B-4 Matching fields with different symmetry-transformation properties 533 


Heavy particles do not themselves exist in chiral multiplets. For example, the 
chiral partner of 0(770) would be the JPC = 1** state a(1260). The a(1260)— 
(770) mass difference is considerable, and attempts to pair these particles in a 
chiral multiplet would clearly be a matter of speculation. However, since each falls 
into vectorial flavor (SU (2) or SU(3)) multiplets, it makes sense to build in only 
vectorial flavor invariance without invoking assumptions about chiral properties. 

We shall proceed by first working out an example, the fermionic sector of the 
linear sigma model, 


Lr=wWlip—-g(o—it-nys)lv 
- _ S\ _ 
= Wiipw. + Wripr — gv (1 + 5) (YLUYr + YrU YL). (4.0) 


We shall drop reference to the scalar field S in the following. The above lagrangian 
is invariant under the chiral transformations 


YL > Lyu, Wr —> Rwr, U> LUR, (4.2) 


with L in SU (2); and R in SU(2)p. As always, we are free to change variables 
via contact transformations. In this instance, a useful choice of field redefinitions 
turns out to be 


N, = tY, Ng = EWR, U=€6&, (4.3) 


where € = exp(it -mx /2Fx). This is seen, after some algebra, to convert the fermion 
lagrangian to 


Li, =N(iP—-Ays—M)N, Dy =9.+iVu, 
j (4.4) 


= i . — l js 

Va = E (60,8 + £0,8") > A= 72 G LAA) , 
which is a theory of fermions of mass M = gv having pseudovector coupling. The 
new fields transform as 


E> LEVi=VER', N, > VN, Nr —> VNp, 


_ a a tah (4.5) 
Va > V (Va -ið V- V) V’, Ay >VA,V', D, N >VD,N. 


For purely vector transformations we have L = R = V. For L Æ R, the property 
of V is more complicated, and Eq. (4.5) implies that it cannot be a simple global 
transformation, but must be a function of x (x) and hence a function of x. At first 
sight, the need to express an SU(2)-transformation matrix like V as a function 
of x(x) appears unnatural. However, it is in fact consistent with physical expec- 
tations. Recall from the general discussion of dynamical symmetry breaking in 
Sect. I-6 that, in the symmetry limit, axial transformations mix the proton not 
with the neutron (as in isospin transformations) but rather with states consisting of 
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nucleons plus zero-momentum pions. Mathematically, the important point is that 
N; and Ne transform in an identical fashion. This corresponds to the fact that 
heavy fields do not transform chirally, but have a common vectorial SU (2) trans- 
formation. It can be directly verified that Eq. (3.5) is a symmetry of the lagrangian. 
Thus, we have obtained the expected result that the baryons can have a vectorial 
SU (2) invariance, while maintaining a chiral invariance for pion couplings. 

We see in the above example the ingredients of a general procedure for adding 
heavy fields to effective chiral lagrangians. The heavy fields are assumed to have an 
SU (2) (or SU (n), if desired) transformation described by the matrix V. A deriva- 
tive ð, acting on a heavy field must be incorporated as part of a covariant deriva- 
tive D,, in order to maintain this invariance. Couplings to pions are described by 
the matrices € and U, with € having the same transformation as in Eq. (4.5). It 
is usually straightforward to combine factors of € and U in such a way that the 
overall lagrangian is invariant. In the general case, each invariant term will have an 
unknown coefficient which must be determined phenomenologically. For example, 
the N Ays N term in Eq. (4.4) would be expected to have a coefficient different 
from unity; the unit coefficient is a prediction specific to the linear sigma model. 
Effects which break the symmetry in an explicit fashion, like mass terms or elec- 
troweak interactions, can be added by using appropriate external sources. To date, 
heavy-field lagrangians have been used in applications primarily at tree level. The 
feature which is essential for their application is that the pion momenta are small, 
and hence the heavy fields are essentially static. 


Appendix C 


Useful formulae 


C-1 Numerics 


Conversion factors (A = c = kg = 1): 


1 GeV~! = 6.582122 x 10° s 1GeV=1.16 x 10! K 
= 0.197327 fm = 1.78 x 107%” g. 


Physical constants (A = c = 1): 


G,, = 1.1663787(6) x 1075 Gev~? Gy? = Mp = 1.2 x 10° GeV 
a! = 137.035999074(44) sin? 6MS (M7) = 0.23125(16) 
mw = 80.385(15) GeV mz = 91.1876(21) GeV 
me = 0.510998928(11) MeV mp = 938.272046(21) MeV 
F, = 92.2(2)MeV Fx = 110.4(8) MeV 
(yi ( = 2.232(11) x 107° Inoo| = 2.220(11) x 107°. 


CKM matrix elements: 


[Vaal = 0.97427(15)  |Vas| = 0.22534(65) | Vay] = 0.00351 75. 00013 


[Veal = 0.22520(65)  |Ves| = 0.97344(16) | Ven] = 0.0412+0:0011 


[Veal = 0.008674000007? |V| = 0.040450 ios [Va] = 0.999146+0:000071, 


C-2 Notations and identities 


Metric tensor: 


gt, =4. (2.1) 


og 
= 
II 
ooor 
(= 
| 
_ 
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Totally antisymmetric four-tensor: 


+1 {, v, a, B} even permutation of {0, 1, 2, 3} 
eH" — {t —1 odd permutation 
0 otherwise 


a eee = git ge” ake + g” gP g Ba’ te” g% a gh 
— gi gat gP — gP g gba’ _ g gP gP, (2.2) 
Totally antisymmetric three-tensor: 
+1 {i, j,k} even permutation of {1, 2, 3} 
€ijk = 4 —1 odd permutation 
0 otherwise 
eviik 
€ijk€ilm = Ô j18km = Ô jmôki- (2.3) 


ijk 
= —€0ijk = €" " = €ijk 


Pauli matrices: 
a'o" = ÎI + ieo! (j,k, 1 = 1,2,3) 
d- = 26 ad9be — Ôabôcd (a, b, C, d = 1, 2). (2.4) 


oipo 


Dirac matrices: 


ys=—iy®y!y*y? 


a 
= 
II 


$ 
a 

yey y® = g” y" + g” y" — gy” — ich yeys 
VTi? = s Cale y“ys, o”) 


yTiy =- T=). (2.5) 
Trace relations: 
Tr (y“)=0 
Tr (ys) =0 
Tr (y*y") = 4g” 
Tr (y“y”ys)=0 
Tr (y#y”y"yf) = 4 (gg — ght gh + gh? ova) 


rey yyy) = ~ 
Tr (hi.. Fonsi) = 
Tr (di. fan) = Tr (fon ERA fi ) . (2.6) 


C-2 Notations and identities 537 


Plane wave solutions: 

The Dirac spinor u(p, s) is a positive-energy eigenstate of the momentum p and 
energy E = yp? +m’. Antifermions are described in terms of the Dirac spinor 
v(p, s). The adjoint solutions are denoted by u = uty? and ọ = v'y®. Note that 
our normalization of Dirac spinors behaves smoothly in the massless limit. 


(pb — m)u(p, s) = 0 
u(p, s)(p —m) =0 
(p + m)v(p, s) = 0 
v(p, s)(p +m) =0 
u(p, r)u(p, s) = 2mô,s 
v(p, r)u(p, s) = —2mô,s 
u` (p, r)u(p, s) = 2ES;s 
v'(p,r)v(p, s) = 2Eô,s 
J up, sap, 5) = p+m 


sS 


J op sip. s) =p- m. (2.7) 


S 


Gordon decomposition for a fermion of mass m: 


(+e) ieo), 
2m 


2m 


i (p', r) y“u(p, s) = ū(p',r) ) u(p,s). (2.8) 


Dirac representation: 


o fi © (0o „jO SA 
PoG 73 aa) | OO 


o-p 
Xs Xs 
u(p,s)=JVE+m o-p v(p,s) =VE+m E+m 
Xs Xs 
E+m 


(2.10) 
Fierz relations: 
The anticommutativity of fermion fields and the algebra of Dirac matrices imply 
the (particularly useful) Fierz relations, 
Wir" A + ys)datsyu(l + ys)ya = Wir" A + ys) Yapay (l + ys)y2 
piy A + ypy — ys) ya = -240 — ys) yap + ys)¥2. 11) 
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Propagators: 
The propagators associated with fields p(x), w(x), W, (x) having spins 0, 1/2, 1 
and masses u, m, M are, respectively, 


d'p g-iPx i 

(27) pP- pete 
d'p Pa i (p + M) pa 
(27)4 p — m +ie 


iA p(x) = (IT ee |0) = J 


iSrgal(x) = (OIT (We(x)We(0)) 10) = / 
i Drix) = (O|T (Wœ) W (0)) 10) 


d’ py i (8w + v/ (P — EM? +i 
_ P pips 18 + (1-5) PrP / (p £M? + ie) (2.12 
(27) p — M? +ie 

where & is a gauge-dependent parameter. 
Feynman parameterization: 

1 T 1 n—1 i= m—1 

_ (n +m) d xen x) ee 
a"b™ T(n) (m) Jo [ax + b(1 — x)]"™ 

1 l i 1 

—=2/ xax f dy—— a (2.13) 

abc 0 o [ad — x) + bxy + cx(1 — y)] 


C-3 Decay lifetimes and cross sections 


Parameters of choice for quantum fields: 

The literature reveals a variety of conventions employed in quantum field theory. 
We can characterize all of these with certain parameters of choice, Jj, Ki, Li 
(i = B, F distinguishes bosons from fermions), occurring in the normalization of 
spin zero and spin one-half fields, 


3 
g(x) = J T (atkye** + al (kye"**) 


B 


d? a ; 
ioes J F_ (OPU, s) + d'p, swp e), GD 


in momentum space algebraic relations, e.g., 
[a(k), a` (k')] = Kgô’(k — k’), 
[bP. r), b'P, 8) } = Krô,sô* (p — p’), (3.2) 
and in the normalization of single-particle states 


|k) = Lga'(k)|0), Ip, s)r = Leb’ (p, s)|0). (3.3) 
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It is convenient to introduce an additional parameter Np to characterize the choice 
of fermion spinor normalization, 


u` (p, r)u(p, s) = Nr2Epbrs. (3.4) 
For uniformity of notation, we also define Ng = 1. The constants J;, Ki, N; are 
constrained by the canonical commutation or anticommutation relations to obey 
KiNi _ 1 
J? Qx)32E 


(i = B, F). (3:5) 


Using the above, one can express the single-particle expectation value of the quan- 

tum mechanical probability density as 

KL? 

Pi = On)? 

The conventions employed in this book, together with the implied normalization 
for boson or fermion single-particle states, are 


(i = B, F). (3.6) 


Lg=Lr=Ng=Np=l, Jg = Jp = Kg = Kp = 2E (27), 
(p’, s|p, r) = 2Epô;s (27) 8® (p' — p), (3.7) 


where r,s are spin labels. This choice, although somewhat unconventional for 
fermions,! has the advantages that bosons and fermions are treated symmetrically 
throughout the formalism, the zero-mass limit presents no difficulty, and matrix 
elements are free of cumbersome kinematic factors. 


Lifetimes: 
From the decay law N (t) = N (0)e™"/7, the inverse mean life t™! is seen to be the 
transition rate per decaying particle, T = t™! = —N/N. For decay of a particle 


of energy E; into a total of n — 1 bosons and/or fermions, the S-matrix amplitude 
can be written in terms of a reduced (or invariant) amplitude Mg as 


n KL 
(FIS — 1i) = —i (27)t8® (p1 — p2- — pa) | [ ( i i 
= i 
n Pk 1/2 
= Fn Ooa pe So: ( ) Ma, a8 
i(21)*8 (pı — pr æ» [| 2E, Nx yee 


where the index k labels the individual particles as to whether they are bosons or 
fermions. The inverse lifetime is computed from the squared S-matrix amplitude 
per spacetime volume VT and incident particle density p1, integrated over final- 
state phase space. The choice of phase space is already fixed by our analysis. Thus, 


! Another book sharing this convention is [ChL 84]. 
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defining a parameter of choice A(p) for the (momentum) phase space per particle, 


Ph ticl Í A (3.9) 
ase space per particle = : 
pace per p A(k)’ 
the application of completeness to Eq. (3.7) yields 
1 ak 1 2 3 
PIP =j -P Ik)(klp) => A= KL = 2ry p. (3.10) 
The inverse lifetime (or decay width) is then given by 
1 1 T Ëp \ IS-14 
zl EE I] = | lf 
pi Z r (2T) Pk VT 
d? pr 
a Onr ey Hs =p Malz, 
=a = -J (i Ar | 27'S p py a 
(3.11) 


where Z = IT; nj! is a statistical factor accounting for the presence of n ; identical 
particles of type j in the final state, and the sum ‘int’ is over internal ieee of 
freedom such as spin and color. 


Cross sections: 
For the reaction 1 + 2 —> 3 + ...n, the cross section o is the transition rate per 
incident flux. The incident flux finc can be represented as 


Pip2 


Fine = Prpal¥s — Val = 3 [Pr - pr) — mim’, (3.12) 
and the cross section becomes 
d 1 
4 ((pi- P? — mm2)? 
x J (i aim) (277)*8*(pi + po — +++ — Pn) $ Mal. 6.13) 
eng CT) LEN fai 


Watson’s theorem: 

The scattering operator S is unitary, S'S = 1. Thus, the transition operator T, 
defined by S = 1 — iT, obeys i(T — TŻ) = T'T. With the aid of the relation 
(f|T |i) = (i|T|f)*, we obtain the unitarity constraint for matrix elements, 


i (Ti — Tf) = Do TaT afi (3.14) 
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where Ta = (f|7|i). This constraint implies the existence of phase relations 
between the various intermediate-state amplitudes. For example, consider a weak 
transition followed by a strong final-state interaction for which there is a unique 
intermediate state identical to the final state, 

A — BC — BC, (3.15) 


weak strong 


i.e., i = A,n = f = BC. In this circumstance, time-reversal invariance of the 
hamiltonian implies 7i = Ti, so the left-hand side of the unitarity relation reduces 
to —2Im7i¢ and both sides of Eq. (3.14) are real-valued. Denoting the weak and 
strong matrix elements as |T,,|e’°” and |7;|e!°s, it then follows that ôw = 4,. 


C-—4 Field dimension 


We consider a limit in which the theory is invariant under the set of scale trans- 
formations x“ — Ax” (A > 0) of the spacetime coordinates. Associate with each 
such coordinate transformation a unitary operator U (à) whose effect on a generic 
quantum field ® is given by U(A)®(x)U'(A) = å% ®(Ax), where dg is the dimen- 
sion of the field ®. From the canonical commutation relation obeyed by a boson 
field g or the canonical anticommutation relation obeyed by a fermion field Yg, 


(00, x), O= — | ¥al0.¥), WhO] = dup), AD 


it follows that the canonical field dimensions are dp = 1 and dy = 3/2. Com- 
posites built from products of these fields carry a dimension of their own, e.g., 
all fermion bilinears YT y (T is a 4 x 4 matrix) have canonical dimension 3. 
Unless protected by some kind of algebraic relation, a field dimension will gen- 
erally be modified from the canonical value by interaction-dependent anomalous 
dimensions. Field dimensions are particularly useful in ordering the terms con- 
tained in a short-distance expansion, 


A(x)BO) —> J Jen(a) On, (4.2) 


where A, B, O, are local quantum fields. From the scale invariance of the short- 
distance limit, it follows that c,(x) ~ x4n—¢4—48, Thus, the fields O, of lowest 
dimension have the most singular coefficient functions. 


C-5 Mathematics in d dimensions 


Dirac algebra: 
The following set of rules, generally referred to as NDR (naive dimensional regu- 
larization), is the one most commonly used in the literature. We employ a metric 
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guv Corresponding to a spacetime of continuous dimension d and maintain certain 
d = 4 properties of the Dirac matrices such as the trace relations of Eq. (2.6). In the 
following, I4 is a diagonal d-dimensional matrix with Tr I4 = 4 and € = (4—d)/2. 


gp =d 
(His Yv} = 28uvla 
Yuy =d Ig 


Yu py” = Qe - 2) p 
Yu bp y" =4p-qla— 2e pg 
Mw Pty’ =—2fq pt pg +t 
pht + thp =2p-qt+2q-rp—2p-ry 
{Yu ys} = 0. (5.1) 


Note that in NDR, ys anticommutes with the gamma matrices. This will suffice 
for the calculations appearing in this book, but is not valid for all amplitudes (e.g. 
closed odd-parity fermion loops). 


Integrals: 
For the following integrals, we define the denominator function 


D = mx + m4 — x) — q°x(1 — x) — ie, (5.2) 
take nı, n2 > 1, and denote ie as the infinitesimal Feynman parameter. 
d‘p 1 
2x)! [(p —q)? — mj + ie]"' [p? — m3 + ie]”” 
i T(n +m —d/2) r x" — x)! 


7 = = 3 5.3 
E (4m) Tr) o  Dmtn—a72 (5.3a) 
d!p p” 
(27)! [(p = q)? _ m? as ie|" [p? _ m? rs Pia 
i Tin, +m — d/2) l x" (1 — x)” ! 
En ——_, b 
! ) (4m )1/? r (ni)I (n2) 0 x Drı+nz—4/2 ( ) 
d'p p" p° 
(27)! [(p = q)? = m? d ieļ" [p? _ m2 4 a 
i (—1)"+”2 r f atiga 
a paa r d 2 J xr =x 
(47)4/? Pm) Pn) [a aa 2 | Ae an 
pv 1 ree ol = eye! 
=A ate 1 —d/2) i dx pmm- d2 5 (5.3c) 
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d“p p“ p” p* 
2r)! [(p-q)+ie]" [p? — mj + ie]”” 
i (—1)71 +2 x™+2(1 = a ae 


1 
= haved _ 
= GaP Faora E qqVin+m a» f dx oE 


1 7 P n 1 x" (1 =a)" 1 
=z 04 gg tg a +e "a Hrm +m —1—4// Cea |? 


(5.3d) 
d’ p p” p” pèp? 
2r)! [(p-q)} -m +ie]” [p? — m3 + ie] 


rd = x)"27 1 


i pe V A0 
~ Um T n (na) grag g Ta + moda pmd 


1 PE aes xmithy — x)” 1 
E gg? +24" vajran i= pe al E eee ae 


PET uv g pr gtk Ho er 4+ 2—d/2 a yrz Md — x27 l 
PAG g g (nı +m — / a X pnma da | 
(5.3e) 
Solid angle: 
T Hes kg 27 271/2 
Qa = | d@q_-\ sin‘ 64-1 ea | do sin f do, = ‘ 
0 0 0 r(d/2) 
Q = 27, 23 =4r, Qy=2m’,.... (5.4) 


Gamma, psi, beta, and hypergeometric functions: 


ra=] dt e™ t! (Rez > 0), 
0 


N(zt+t)=2@=2z2-DFz—-1)=:::= 2), 
(=)! 


n! 


r (=n +€) = E +y(n+1)+ oo| (n integer), 
dI (z)/dz =V(z)w(z) where w(z+ 1) = y (z) + 1/z, 
WO) == = lim (1+3 be È- Inn) = -0.5772 
dy(z + 1)/dz = Y" (z + 1) = Y") — 1/2? with Y'(1) = x?/6, 


Term). fe Pe 
Bz, y= ee =f dt a pare (Re z, Re w > 0), 
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r(c) l b-1r1 _ pe-b-1 =ä 
mnl dtt™ (1—t) (1 — zt) 


(Rec > Reb > 0), 
T(c) (ec —a—b) 
l(c — a) (c — b)’ 


dF(a,b;c;z) ab 
de = i F(a+1,b+1;c+1;z). (5.5) 


F (a,b; c; z) = 


F(a, b; c; 1) = 
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